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THE EFFICACY OF HABIT TRAINING IN CHRONIC SCHIZOPHRENIA* 


A. R. MAY AND J. P. S. ROBERTSON 
Warlingham Park Hospi Netherne Hospital, 
urrey, Surrey, England 


PROBLEM 
The term habit training is applied in Great Britain to procedures aimed at the 
rehabilitation of severely disorganized schizophrenics. In the United States similar 
procedures are usually termed total push following Myerson“. A recent account of 
them is given by Sines, Lucero and Kamman“?. Though the procedures are widely 
practised in Great Britain little has been published concerning them. Details of 
their application show marked variation from one institution to another but the 
only available guide is the description of Symons“. The only experimental study of 
their effectiveness appears to be that of Bennett and Robertson“? which arrived at 
uncertain conclusions. Opinions continue to be divided on this issue of effectiveness. 
Some observers think that the procedures in question lead to genuine improvement 
in the habits of disorganized schizophrenics. Others believe that the apparent im- 
provement results merely from increased nursing care and stimulation, disappearing 
immediately that these are withdrawn. The present investigation was directed to 
determine the efficacy of habit training more conclusively than Bennett and Robert- 
son“) were able to do. * 


METHOD 


Period and Procedure of Training. Habit training was carried out over a period of 
six months following the routines described by Bennett and Robertson”? and Sy- 
mons‘), with emphasis placed on variety of occupation, increased social stimula- 
tion, and avoiding monotony. A full and varied programme of social activities and 
entertainments was arranged. Several activities were undertaken in conjunction 
with female patients of the same type. The experimental group were under the super- 
vision of the same nurses throughout. Besides conducting the ~ ~ial activities these 
gave careful attention to the dress and personal habits of the imental patients. 
They also operated a system of rewards for patients who u.a well in any respect. 
The control group were under the care of many different nurses, were provided with 
none of the special social activities of the experimental group, and received no con- 
centrated personal supervision. 


Patients Investigated. Random sampling numbers were used to select 10 patients for 
habit training from a ward of 54 male chronic schizophrenics. From the remaining 
patients in the ward a matched control group was drawn. The control group were 
selected to match the experimental group as closely as possible in distribution of 
ages, occupational levels before admission, and ratings on personal habits, contact 
with the environment, aggressiveness, and volitional behavior. The four ratings 
were made by four independent observers on constructed scales and the mean rating 
for each patient on each trait was calculated. Matching was possible with a con- 
siderable degree of closeness. Four additional characteristics which might differen- 
tiate the groups were not used for matching but were subsequently tested for statis- 
tical significance. These were: having previously undergone habit training; having 
previously had electric convulsive therapy; having undergone prefrontal lobotomy; 
and being diagnosed as mentally defective. The groups did not significantly differ 
in frequencies here. 


Techniques of Assessment. The experimental and control groups were each assessed 
by seven different techniques immediately before the period of training began and at 


*The authors’ thanks are due Dr. R. K. Freuden henna wetninee 
pital, for his encouragement of this investigation. berg, physi perin t, Netherne Hos- 
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its conclusion. Both sets of assessments were made after an interval of 48 hours 
during which the experimental and control groups were left to care for themselves 
without the intervention of nurses. Stringent supervision was exercised to ensure 
that the nurses did not in fact intervene during this period to improve personal 
appearance. 

The seven techniques of assessment were (a) weight, (b) scores on a test of 
simple information, (c) scores on a test composed of a series of simple perceptual- 
motor tasks, (d) ratings on social behavior while doing the two foregoing tests, (e) 
ratings on personal appearance as observed in the test situation, (f) ratings on per- 
formance in a group task, and (g) ratings on the Fergus Falls Behavior Scale. ®: * 

Weight was included because it is sometimes held that habit training leads to an 
increased intake of food in chronic schizophrenics. The simple information test 
comprised 10 questions such as “‘What is the name of this hospital?” In the simple 
tasks test, the patient stood by a table on which various articles were laid out in 
constant positions and was required to carry out 10 diverse actions with them such 
as to pick up a ruler, to pg « head with it, to point to something red, etc. 

The ratings on social behavior and personal appearance were made by three 
independent judges using a four-point scale. In social behavior nine separate aspects 
were rated and in personal appearance 12 aspects. The ratings were summed to 
yield a total score for each technique. In the group task the experimental and con- 
trol patients were each as groups required to remove bricks from a pile to an indi- 
cated spot 60 yards away. Each patient was rated individually in regard to his per- 
formance on the task by five independent observers using a four-point scale applied 
to five different aspects. The ratings were summed to yield a total score. The judges 
on social behavior and personal appearance and those on the group task were all 
unaware which patients were in the experimental and which in the control group. 

The Fergus Falls ratings were made at the beginning and end of the training 
period by the charge nurses of the ward, who possessed knowledge as to which were 
experimental and which control subjects. The charge nurses also made these ratings 
on the remaining patients in the ward. Weekly Fergus Falls ratings were available 
for the experimental group only, made by the nurses supervising them. 


Analysis. Statistical comparisons were made of (a) initial assessments of experi- 
mental versus initial assessments of control group (t-tests); (b) final assessments of 
experimental versus final assessments of control yee (t-tests); (c) initial versus 
final assessments of experimental group (t-tests); (d) initial versus final assessments 
of control group (t-tests) ; and (e) if the first comparison were statistically significant, 
final assessments of experimental versus final assessments of control group allowing 
for the regression of final on initial scores (analysis of covariance). 


REsvLtTs 
The results may be summarized thus, differences beyond the 5% level being 
termed significant and those beyond the 1% level very significant. 
(a) Initially, the groups differed natty in weight and Fergus Falls 
ratings but not in the other assessments. The experimental group were heavier and 
also better behaved according to the charge nurses’ opinions. 


(b) Finally, the experimental group showed very significantly higher ratings 
on ‘pmmme appearance and significantly higher Fergus Falls ratings than the con- 
trol group. 

(c) In comparing the final with the initial assessments of the rimental 
group there was a very significant loss in weight and there were very significant gains 
in personal appearance and Fergus Falls ratings. The weekly Fergus Falls ratings 
made by the supervising nurses also showed a very significant increase when the 
mean of the last five weeks was compared with that of the first five weeks. 


(d) The final compared with the initial assessments of the control group show- 
ed a very significant gain in Fergus Falls ratings. 
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(e) When the final assessments of the two groups were compared making 
allowance for the regression of final on initial scores, the experimental group dis- 
played a significant loss in weight but the Fergus Falls difference was no longer 
statistically significant. 


DIscussION 

The fact that there were no significant changes in the scores on simple informa- 
tion and simple tasks and the ratings on social behavior and the group task is an un- 
equivocal result. The evidence is strong that habit training, even in conditions of 
full and varied social stimulation, does not produce changes for the better in general 
alertness, practical efficiency or social adaptability. The significant change in ratings 
on personal appearance is also an unequivocal result. It definitely establishes that 
habit training can produce a change for the better in care of personal appearance 
which is sufficiently firm to survive 48 hours without aid or stimulation from nurses. 

The interpretation of changes in Fergus Falls ratings and weight is more doubt- 
ful. Data on both these assessments were available for the remaining patients in the 
ward, who were neither experimental nor control subjects. These indicated that all 
54 patients showed a significant improvement in Fergus Falls ratings at the end of the 
six months as compared with the beginning. The most obvious explanation is that 
the change is an artifact of rating due to some such phenomenon as central tendency, 
shift in adaptation level or familiarization and increased personal involvement with 
the patients. The remaining patients like the experimental group, showed a signi- 
ficant loss in weight; the deviant patients were the control group. The differences in 
weight must be ascribed to some other factor than habit training. 


SUMMARY 

The meaning of the term “habit training” in British usage is described. An 
investigation is reported which assessed the effects of habit training on chronic 
schizophrenics by comparing an experimental group of 10 patients with a matched 
control group. The methods of training and assessment are briefly outlined. The 
results indicated that the habit training of chronic schizophrenics does not produce 
changes for the better in general alertness, practical efficiency, or social adaptability 
but does produce an improvement in care of personal appearance which is sufficiently 
firm to survive 48 hours without aid or stimulation from nurses. 
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MAZE PERFORMANCE AND PRIVILEGES IN A 
PSYCHIATRIC HOSPITAL! 


JOSEPH KUNCE AND WAYNE ANDERSON 
Veterans Administration Hospital, Jefferson Barracks, Mo. 


PROBLEM 


This study specifically investigated the effectiveness of the Peters’ Circular 
Mazes for measuring the decision-making capacity and hence potential for self- 
management of psychiatric patients. Self-management was judged in terms of the 
ward staff’s decision to grant or to withhold privileges. The Peters’ Circular Maze 
Test (PCM) consists of a series of thirteen printed circular pencil mazes of increasing 
complexity. According to Peters®?, the only major factor present in maze threading 
is “the subject’s capacity to make choices between alternatives, under the guidance 
of a goal idea.” This decision-making capacity he feels is related to “common 
sense” or ability for self-management. The PCM has the advantages of being easy 
to administer and score, being relatively culture free, and only slightly influenced by 
the factor of chance. 


PROCEDURE 


The subjects were patients admitted for functional psychiatric conditions to 
Jefferson Barracks VA Hospital. To minimize the possible effects of aging upon test 
performance, patients over 50 were excluded. A few patients were excluded for 
failure to cooperate with testing procedures. The first sample of 117 subjects in- 
cluded all admissions to this hospital from May to the middle of August, 1959 who 
met our criteria. For a cross-validation of results, a second sample of 110 patients 
comprised of admissions from the latter date through November, 1959 was obtained. 
The ward staff was essentially constant during this period. The decision for transfer 
was in no case contingent upon knowledge of maze performance. 

The data collected on each subject included: (a) the standard score on either of 
two forms of the PCM, administered within a week after admission, (b) score on a 
vocabulary test administered at the same time, (c) admission diagnosis, and (d) 
residence on the thirty-first day after entry into the hospital, categorized as closed 
ward, privileged ward, or discharged from hospital. 

The relationship of maze performance and subsequent granting of privileges 
was evaluated by an analysis of variance computed upon the maze scores by pro- 
gress category (closed ward, privilege ward, and discharged). This procedure was 
carried out on the first sample and repeated upon the cross-validation sample. The 
same method was used to evaluate the relationship of vocabulary scores with pro- 
gress. Chi-square was utilized to evaluate the relationship between diagnosis and 
privileges for both samples. 


RESULTS 

The data collected were comparable for both samples. In the first sample the 
mean standard score for the maze test was 40.28 and for the second sample it was 
39.55. The mean raw scores on the vocabulary test were 25.15 and 24.83 respectively. 
In the first sample, 63% of the patients had been granted privileges after 30 days, 
and in the second sample 58% had received privileges. The per cent of patients in 
each sample with admission diagnosis of a non-psychotic nature was also similar, 
35% and 30% respectively. 

The mean standard score on the maze test was found to be significantly higher 
for those patients who had been granted privileges after thirty days than for those 
who remained on locked wards (Table 1). This finding was true for both samples 
(F values 7.24 and 8.71, both significant beyond the .001 level). 


1We wish to express our appreciation to Henry N. Peters for allowing us to use data from his files. 
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TABLE 1. Mean Maze AND VocaBULARY ScorEs AT ADMISSION CATEGORIZED ACCORDING TO 
Patients’ PLace oF ResipENceE 30 Days LaTEeR 


Sample Closed 


Ward 


Mean Standard 1 34.48 42.63 46.00 -001 
Scores for PCM 2 32.64 45.20 41.92 001 
Mean Raw Scores 1 24.71 25.49 25.25 
For Vocabulary 2 20.61 26.76 24.54 .05 


The scores of the admission vocabulary test did not produce consistent results. 
In the first sample, there was no significant difference among the mean vocabulary 
scores for the three groups of patients. In the second sample, however, the differ- 
ence among the means was significant at the .05 level with a F value of 3.99 (Table 1). 

Admission diagnosis also yielded inconsistent findings. The neurotics pro- 
gressed at a rate significantly greater than chance expectations based upon the 117 
subjects in the first sample (x? = 6.223, significant at the .05 level). The psychotics 
in this sample and the neurotics and psychotics in the second sample did not differ 
significantly from the appropriate chance expectations in rates of progress (Table 2). 
Taking samples 1 and 2 together the neurotics as a group progressed at a rate greater 
than chance (x? = 8.578, significant at the .02 level). 


Tasie 2. Privitece Status 30 Days Arrer ApMISSION OF PaTIENTS DIAGNO3ED NEUROTIC OR 
Psycnotic Upon ADMISSION 


Sam Admission Closed Discharged Level of 
Diagnostic Ward W: Significance 
Category N N 
1 Neurosis 8 20 ; 13 05 
Psychosis 36 28 12 


17 6 
34 


Because of the similarity between the first and second samples, a summary 
table (Table 3) was prepared combining both samples. In this table progress has 
been dichotomized into no progress (closed ward) and progress (privileged ward or 
discharged) since maze performance did not show consistent differences between 
privileges and discharge categories (Table 1). Table 3 illustrates the confidence one 
can place in predictions with respect to a given maze score as well as showing how 
scores are differentially related to progress according to admission diagnosis. 


3. Maze Scorzs, DraGNosis AND Procress. Nevrotics (Ne) N = 74; 
Psycuotics (Ps) N = 153; Tora (To) N = 227. 


Number 


Per Cent 
Progressing 


oo SER 


Privileged Discharged Level 
Ward Patients Signif. 
2 Neurosis 10 
Psychosis 36 
Maze Standard Number Not Bacay 
Score Progressing 
Ne Ps To Ne |_| To Ne Ps To 
0-9 1 6 7 0 1 0 14 12 
10-19 2 13 15 2 4 50 13 21 
20-29 6 11 17 10 18 62 42 51 
30-39 3 13 16 3 17 50 52 51 
40-49 6 15 21 16 40 73 61 66 
50-59 0 7 7 20 49 | 100 81 87 
60-69 0 7 7 5 8 | 100 30 53 
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Discussion 

It is apparent from the results that a patient’s performance on the PCM is 
related to those characteristics which the ward staff consider important in the 
granting of privileges. It should also be noted that the mean standard score for the 
maze in both samples was approximately 40, or one standard deviation below that 
of normals, a finding consistent with that of Peters“. These results suggest that 
the use of mazes as a measure of social-psychological functioning may have general 
applicability. Traditional intelligence tests are not likely to be as sensitive mea- 
sures of this functioning, as demonstrated by our finding that vocabulary perform- 
ance was not consistently related to progress. 

It was noted that the psychotics who earned scores over 60 (Table 3) failed to 
progress at the expected rate. Examination of the records of these seven individuals 
revealed that three of them were transferred to a privileged ward several days after 
the thirty-day cutting criterion. The remaining cases had records of aggressive be- 
havior. As shown in Table 1 there was a lack of significant differences on the data 
for patients on privileged wards and those discharged. One possible reason for this 
is that given a certain degree of emotional stability, environmental factors such as 
family acceptance or the availability of employment may become more important 
with respect to discharge than the individual’s psychiatric condition. 


SUMMARY 

This study investigated the relationship between a patient’s ability to make 
adequate choices between alternatives as measured by a maze test and the granting 
of privileges by the hospital staff. Data were collected for two groups of NP patients 
totaling 227 cases on: Peters’ Circular Pencil Maze Scores, vocabulary test score, 
diagnosis, and place of residence at the end of thirty days. Maze performance was 
found to be related to progress at the .001 level. The mazes appear to be a useful 
technique for screening patients for early discharge from an institution. 
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VERBAL MANIPULATION IN A PSYCHOTHERAPEUTIC 
RELATIONSHIP! 


HENRY C. RICKARD, PATRICK J. DIGNAM”? AND ROBERT F. HORNER 
Veterans Administration Hospital, Tuscaloosa, Alabama 


PROBLEM 

Numerous verbal conditioning studies have been reported in the literature and 
excellent reviews of the area are available®: *». The writers have been unable to 
locate, however, a single study reporting verbal conditioning in an actual thera- 
peutic treatment case. This paper reports the manipulation of verbal behavior in a 
60 year old male who has been hospitalized continuously for over twenty years. The 
patient was verbose, expressing freely delusions of grandeur and persecution. It was 
decided that rational speech would be designated as the dependent variable to be 


— is extended to Dr. M. Dinoff and Dr. E. O. Timmons for critical readings of the 
manusc 


*Nor ow A Florida State Hospital, Hollywood, Florida. 
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increased while the verbalization of delusions would be ignored or mildly punished. 
In other words, a direct attempt was made to reduce a specific class of deviant 
verbal behavior; no attempt was made to cope with underlying attitudes, dynamics, 
or feelings. 


PROCEDURE 


Three different Zs worked with the S in the experiments to be described. In 
Experiment I the patient was seen for 35 sessions each of which was 45 minutes in 
duration. Sixteen of the sessions were tape recorded and later timed by Z 1 for 
minutes of rational speech. The delusional systems in the S were easily identified.’ 
E 1 turned away from the S, gazed at the floor, looked out the window, etc., while 
the S’s speech was delusional. This procedure was operationally defined as mild 
“punishment.” EZ 1 reinforced all rational verbalizations at a rate of from six to ten 
reinforcements per minute. Reinforcement was grossly defined as a smile, nod, ex- 
clamation expressing interest, etc. Since resistance to extinction of the response class 
‘rational verbalization’”’ was desired, an attempt was made to place the response 
under a lower partial reinforcement schedule during the last 5 sessions. Reinforce- 
ment was cut in frequency from approximately 8 per minute to less than 1 per 
minute. 

In Experiment II the same S was used and, again, rational verbalization was the 
class of behavior to be reinforced. At this point a second experimenter (EZ 2) saw 
the patient once a week for approximately 6 additional months and followed the 
same conditioning procedure used in Experiment I. No attempt was made to record 
the data during this period. After 6 months, delusional material was still prominent 
in the patient’s verbalizations, but contrary to earlier sessions it seemed evident 
that the delusional stream could be turned on and off by experimental manipulation. 
To test this hypothesis the patient was subjected to alternating 10 minute periods of 
“minimal reinforcement”’ and ‘maximal reinforcement.”” The former consisted of 
E 2 aperiodically directing his gaze toward the patient (who always maintained a 
steady flow of speech) at the rate of three times per minute, smiling, and saying 
“um-hum.” This “minimal reinforcement” was administered regardless of what the 
patient might be saying at the time and was designed to maintain rapport and keep 
the patient from becoming too uncomfortable. During ‘‘maximal’’ reinforcement, 
delusional verbalizations by the S were interrupted; the experimenter encouraged 
the expression of non-delusional material of known valence to the S. E 2 saw the S 
for four 30 minute sessions over a period of two weeks, during which the S was ex- 
ete to alternating periods of ‘“‘minimal’’ and “maximal” reinforcement. After the 
ourth session E 3 was introduced to the S and the above procedure replicated. 


REsULTS 


In Experiment I the patient initially gave approximately two minutes of 
rational speech per 45 minute session. By the 21st session this had jumped to ap- 
proximately 25 minutes of rational speech. The mean number of minutes of rational 
speech for the 21st through the 31st sessions was 30 minutes with a range from 8 
to 44 minutes. It should be noted that during the period of time that Experiment I 
was being carried out, ward personnel were instructed to essentially ignore the 
patient when he was delusional on the ward, but to elicit and reinforce rational 
speech through questions, smiles, etc. It was hoped that this procedure, which was 
very similar to that followed by EF 1, would promote maximal generalization from 
kt are ey to the ward environment. Generalization from the ward treatment 

ack to the psychotherapy sessions is an equally tenable hypothesis. At the begin- 
ning of the 31st session the S was put on a lowered schedule of reinforcement. For the 
remaining 5 sessions a sharp, progressive decrement in amount of rational speech 
resulted; an average of only 7 minutes of rational speech occurred during the last 


‘Examples of delusional speech were as follows: “I have a fractured head and a broken nose be- 
cause of spinal pressure.” “Stars have metal bottoms and exert a magnetic pressure on the earth.” 
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two 45 minute sessions. The rapid drop in rational speech when a lower partial re- 
inforcement schedule was attempted, the fluctuating nature of the dependent var- 
iable, and the fact that a full session of rational material was never achieved, demon- 
strates the tremendous resistance to the extinction of the delusional verbalizations. 


Taste 1. Minutes oF Rationat SpeecH Emrrrep Durine ALTERNATING 
10-Mrinvuts PEriops oF MINIMAL AND MAXIMAL REINFORCEMENT 


10-Minute Periods E2 


1st Session 


2nd Session 


3rd Session 


4th Session 


*Represents minimal reinforcement periods. 


The results of Experiment II are presented in Table 1. It is obvious that, 
initially, rational material was expressed at a very low level to the new experimenter 
(E 2), during periods of minimal reinforcement. Apparent also, is the fact that E 2 
very quickly elicited a high percentage of rational speech during the maximal re- 


inforcement period. The acquisition of rational verbalizations during periods of 
minimal reinforcement was less rapid. Table 1 also shows the second phase of 
Experiment II in which another experimenter (Z 3) participated. It is apparent that 
both phases of Experiment II show similar trends. There is a suggestion that the 
dependent variable was expressed at a lower level to E 3 as compared to EZ 2 during 
the minimal reinforcement period. 


Discussion 

The results of Experiment | strongly suggest that in a structured psycho- 
therapeutic relationship a selected class of verbal behavior can be modified through 
reinforcement techniques. It is further indicated, however, that this modification 
may be quite tenuous, existing only under a high frequency of reinforcement. At- 
tempts to place the dependent variable on a lower frequency of reinforcement and 
thus build a habit which would show substantial resistance to extinction resulted 
in rational verbalizations dropping to a very low level. 

At the time Experiment II was initiated it had become a simple matter to 
manipulate the dependent variable through elicitation and reinforcement. It was 
then demonstrated that the patient could be conditioned to increasing amounts of 
rational speech during reinforcement periods over a series of alternating periods of 
minimal and maximal reinforcement. The fact that the S tended to talk more 
rationally to E 2 than to £ 3 can be viewed in different ways. Perhaps E 2 possessed 
more reinforcement value since he had previously seen the patient in a therapeutic 
relationship. On the other hand, it might simply be that Z 2 could more easily 
initiate topics which would lead to rational speech since he had greater knowledge of 
the patient. Tine S’s responses to both Hs in Experiment II clearly show a progressive 
increment in the amount of rational speech for the 10 minute periods. In addition, 
learning in the maximal reinforcement periods apparently generalized to the minimal 
reinforcement periods. It should be noted that when £3 began working with the S 
the dependent variable dropped 90 per cent. Apparently the learned habit of 


E3 

1* 1* 

9 8 

8* 1* 
9 10* 

5° 10 
9 5° 

9* 10 
10 5* 

9° 10 

10 8* 
10* 10 
10 9* 
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rational responding to Z 2 was disrupted by the radical ‘cue change” of a new 
experimenter. 

A high level of rational speech was obtained in this patient through elicitation 
and reinforcement techniques. It is apparent, however, that the newly established 
behavior showed little resistance to extinction in either experiment. This is not too 
surprising since the incompatible habit, verbalization of delusional material, had 
been reinforced over a long period of time and under a wide range of stimulus condi- 
tions. Various other attempts could be made to build in the desired verbal behavior. 
For example, a series of experimenters could repeat the procedure followed by E 2 
and £ 3 in the expectation that generalization of the rational verbalizations to the 
stimulus class “other people’ would eventually be enhanced. 


SUMMARY 

Rational verbalizations in a neuropsychiatric patient were chosen at the class 
of behavior to reinforce while an effort was made not to reinforce the incompatible 
class, delusional material. E 1 conditioned the dependent variable, rational speech, 
to a high level of occurrence under a high frequency of reinforcement, but the condi- 
tioned response dropped sharply when an attempt was made to lower the frequency 
of reinforcement. EH 2 exposed the same S to alternating ten minute periods of 
“minimal” and “maximal” reinforcement and demonstrated conditioning of the 
same dependent variable. E 3 replicated the procedure which £ 2 had followed and 
obtained similar results. 
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CONDITIONING VERBAL BEHAVIOR OF SCHIZOPHRENICS IN A 
GROUP THERAPY-LIKE SITUATION! 


MICHAEL DINOFF, ROBERT F. HORNER, B. 8. KURPIEWSKI AND EDWIN O. TIMMONS 
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INTRODUCTION 


It has been demonstrated repeatedly that what one person says or does can in- 
fluence the verbal behavior of another in an orderly, predictable fashion. Krasner “? 
and Salzinger “ have written excellent reviews of verbal conditioning which has been 
shown to occur over a wide range of conditions with a number of experimental pop- 
ulations. Recently, many investigators have come to view psychotherapy as a 
complex form of this conditioning process, but to our knowledge there has been no 
attempt to demonstrate the phenomenon in a group situation. It seemed to us that 
group psychotherapy could be conceived of as a more complex form of verbal condi- 
tioning. 

Cohen“? suggests that a schizophrenic population does not condition readily 
because they ‘‘are deficient in responsiveness to acquired, social reinforcers.’”” He 
continues that schizophrenics should show relatively little learning in a situation 


1The authors would like to express their appreciation to Dr. William B. Robinson for providing 
the necessary subjects and to Dr. H. C. Ri or his generous assistance throughout this project. 
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involving verbal reinforcement. Although he reports that “negative” reinforcers 
clearly result in conditioning, data is also presented indicating that positive reinforc- 
ers produced no learning in a schizophrenic group of subjects. Conversely, Hartman 
®), Krasner and Ullman ®?, and Salzinger and Pisoni“ have demonstrated verbal 
conditioning with schizophrenics. 

The present research was designed (a) to demonstrate verbal conditioning and 
the permanence of its effect in a group therapy-like situation using eliciting and 
reinforcing techniques; (b) to show that these techniques are applicable to a schizo- 
phrenic population; and (c) to establish an easily scorable and reliable index of 
verbal behavior. 


MeEtTHOD 


Subjects. Ten male hospitalized chronic schizophrenic veterans were the sub- 
jects. All were between the ages of 30 and 47, median 39.5 years. All had prior 
hospitalizations, and none were in any form of psychotherapy. No major changes in 
hospital treatment occurred during the experiment. 


Experimental procedure. All subjects were seen in a group situation for 50 
minutes on each of three days to secure their initial response level. At the start of the 
first session instructions were read which lead the subjects to believe that the experi- 
menters were primarily interested in learning more about the interests of patients on 
their ward. 

During the initial three sessions two judges sat behind a one-way screen, ob- 
served the subjects, and scored each verbalization of the subjects as being one of five 
response classes (Personal, Therapist, Environmental, Group, and Ambiguous) on 
the basis of the following instructions revised from a prior study ®?: 

We want you to score these responses statement by statement as being either (P), (T), (E), 
(G), or (A) in accordance with the following definitions: 

1. Score (P) when a statement includes any reference to one’s self (P) or in connection with 
the environment (PE). 

2. Score (T) when a statement includes any reference to the examiner alone (T) or in con- 
nection with the speaker (PT) or in connection with the environment (TE). 

3. Score (E) when a statement does not include (P), (T), or (G). 

4. Score (G) when a reference is made about the group, or at least any two members of 
the group including (P) and at least one other group member present. 

5. Score (A) when a statement other than the above occurs or a statement is too ambiguous 
to score accurately. 


The subjects were divided into two matched groups of five subjects each. Group 
I was seen for six 50-minute sessions by an examiner. During this period, the examin- 
er elicited with questions and reinforced by his approval P responses five times from 
each subject each hour. In addition, he verbally reinforced any spontaneous P 
remark. Group II was similarly treated to maximize G references. 

After the conditioning series, subjects were then returned to the combined 
group situation, without the examiner, and again were asked to speak about what- 
ever they chose. These last three 50-minute sessions were scored by the same two 
judges. Immediately following the last ‘‘extinction’’ session, several questions to 
investigate ‘‘awareness’”’ were asked. 


Counter-Conditioning. After the “extinction” phase, the subjects were re- 
divided into the same groups. However, the group previously reinforced for P 
responses (Group I) was now reinforced for G responses during five 50-minute 
sessions; Group II was now reinforced for P responses. In the counter-conditioning 
phase, a second examiner was used in order to increase the generality of findings. A 
subject from Group I was lost due to elopement. After the second conditioning 
phase, the subjects met in a combined group situation for five additional 50-minute 
sessions with the judges scoring their verbalizations. Again “awareness” was in- 
vestigated. 
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RESULTS 


The degree of agreement between the judges was determined by correlating 
their scores on E and P responses for each operant and extinction day. Since no 
response was ever scored as A, this category was eliminated. The other categories, 
T and G, occurred too infrequently to allow application of meaningful correlational 
techniques. Every correlation obtained ranged between .84 and .99, with 17 of the 
22 correlations being .95 or higher. Even the lowest of the correlations is significant 
at the .005 level. 

Table 1 presents the operant level of responding during the first three sessions. 
Each of the response classes are relatively stable, although there is considerable sub- 
ject variability from session to session. We predicted that the rate of responding 


TaBLe 1. AVERAGE FREQUENCY AND PERCENTAGES (IN PARENTHESES) OF RESPONDING For ALL 
Supsects DuRING THE OPERANT SESSIONS 


Day E T 

1 16.9 (64.3 7.9 (30.0 0.5 (1.9 1.0 (3.8 26.3 
2 15.9 (59.1 9.5 i 0.8 0 0.7 (2.6 26.9 
3 19.3 (68.7 8.3 (20.5 0.0 0 0.5 (1.8 28.1 
Average 17.4 (64.2) 8.6 (31.7) 0.4 (1.5) 0.7 (2.6) 27.1 


would increase for all subjects since, on a partial reinforcement basis, the subjects 
were being reinforced for responding. When both experiments were combined, a 
matched-pair t-test yielded a probability of .07 for total response increase. 

When the initial operant level was compared with the total extinction data, no 
highly significant differences emerged in either experimental group. However, a 
general directional effect appeared. In the initial conditioning, the P-reinforced sub- 
jects showed an increase in P responses that approached significance, which was 
particularly apparent when these subjects were compared to the G group. Four out 
of five subjects in the P group increased in percentage of P responses in the condi- 
tioning experiment, while four out of five subjects in the G-reinforced group de- 
creased in P responses. 

Upon inspection of the data, it seemed possible that the experimental effect 
was but a short-lived one. Each subject’s operant response level, in both experi- 
ments was compared to his performance during the first day of extinction. In the 
conditioning experiment four out of five P-reinforced subjects increased in percent- 
age of P responding, and in the counter-conditioning, five out of five increased 
similarly. When the data were combined they yielded a probability of .01 on the 
basis of the binomial expansion. This effect did not appear in the G-reinforced 


group. 

Further inspection of the data led us to observe in both phases of the experiment 
that G-reinforced subjects increased in their production of E responses (binomial 
expansion p= .09) while P-reinforced subjects did not show this trend. This sug- 
gests that the experimenters may have reinforced some verbalizations other than 
those which the judges scored as G. A third judge, equipped with a more refined 
definition of G responses and using only taped recordings, found a more marked 
increase in G responses post hoc. 

On both questionnaires designed to test “awareness”, subjects reported either 
that the meetings were for them to get to know each other better or for the psy- 
chologists to learn more about patients in general. They all reported “enjoying” 
the sessions and several felt that they “improved” because of the meetings. 


Discussion 

While the effect of verbal conditioning did not result in long-lasting change in 
verbal behavior, the experimental procedure did lead to brief but significant change 
in a group situation with schizophrenics. This suggests that the negative findings 
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reported in some of the verbal conditioning experiments“? may have resulted be- 
cause the effect was hidden due to its very brevity. It does not follow that because 
schizophrenics are poor “conditioners”, or perhaps more correctly fast ‘extin- 
guishers”, methods cannot be found to prolong the effect. Another experiment is 
now in progress which should illustrate the effect of verbal conditioning and point 
up its effect in a generalized situation as well. 

Only minimal attempts were made to investigate the nebulous concept of 
“awareness.” Our findings imply that, in general, the subjects were unaware of the 
experimental procedure since they did not verbalize knowledge. One of the criti- 
cisms of research in verbal conditioning has been the lack of applicability or import 
of the behavior conditioned. We have extended the use of an easily scored, yet 
highly reliable, method of defining verbal behavior which has particular reference 
to the psychotherapeutic situation. More basically, we feel that one of the most 
important findings has been the establishment of a relatively stable method of in- 
vestigating and categorizing an aspect of operant verbal behavior in a group sit- 
uation. 


SUMMARY 

In group psychotherapy the assumption is: made that the same laws of verbal 
conditioning are involved as have been shown to apply in individual psychotherapy. 
Using a newly-developed, easily scored and highly reliable scoring system, in- 
dividual verbal response levels were established in a group situation with a schizo- 
phrenic population. Elicitation and verbal reinforcement were employed to condi- 
tion and then counter-condition either personal or group references. Trends emerged 
in keeping with predictions. Of greatest import is the method of measuring the in- 
itial verbal behavior as well as assessing the resulting dependent variable. 
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Tuscaloosa, Alabama 


INTRODUCTION 


A recent experiment has demonstrated group verbal conditioning with a 
schizophrenic population”. In that study, after an operant rate of verbalization 
was established, ten Ss were divided into two equal groups and reinforced for making 
a selected class of response. The test of learning occurred when the Ss were reunited, 
creating a stimulus situation quite different from the group settings in which condi- 
tioning took place. Since the only test of learning occurred under altered stimulus 
conditions, it could be argued that generalization rather than actual strength of 
_ learning was tested. The purpose of the present experiment was to employ a measure 
of learning during the conditioning process in addition to the investigation of the 
generalization effect. 


METHOD 


Subjects. The Ss were twelve male hospitalized veterans diagnosed as schizo- 
phrenic, between the ages of 34 and 64, with a median age of 41 years. None was in 
any form of psychotherapy and no major treatment changes occurred during the 
experimental period. 


Experimental Procedure. Before and after conditioning all 12 subjects met as a 
group without an experimenter present. Each of these meetings lasted for 110 
minutes, or two “50 minute’ hours. A 5 minute break occurred between each 50 
minute period. The Ss were instrucied to discuss whatever they chose and told 
that they were to be observed from behind a one-way screen. Two judges sat behind 
the one-way screen and made independent judgments as to the subjects’ responses. 
These raters placed each unit of verbalization which was judged to contain a com- 
plete thought in the following four response classes: Personal (P), Therapist (T), 
Environmental (E), and Group (G) — on the basis of the following instructions re- 
vised from a prior experiment“: 

“The sessions you are to score will contain several classes of verbal responses. We want you 
to score each subject’s verbalizations, statement by statement, as being either (P), (T), (E), or 
(G), using the following definitions: 

1. Score (T) when a statement includes any reference to the examiner alone or in connection 

with the speaker (PT) or in connection with the environment (TE). 


2. Score (P) when a statement includes any reference to one’s self (P) or one’s self in connection 
with the environment (PE). 


3. Score (G) when a reference is made about the group, at least any two members of the group, 
or (P) and at least one other group member present, or reference to patients in general. 
4. Score (E) when a statement does not include (P), (T), or (G).” 


Conditioning. The Ss were divided into two matched groups of six Ss each on 
the basis of their response levels during the two hour pre-experimental session. 
Group I was seen for eight 50 minute sessions by one of three examiners who were 
assigned to the group at random. During these sessions the examiner elicited by 
questions and reinforced by attention, reflection, and approval. In addition, he 
verbally reinforced any spontaneous remark containing a P referent. If a direct but 
unrelated question were asked of the examiner, he answered briefly with as little 
affect and as few words as possible. Group II was treated similarly but with G 
rather than P responses selected as the category to be reinforced. Immediately after 
each 50 minute session the E left the room. The Ss were then instructed to continue 


1The authors are grateful for the subjects provided by Dr. Mary Smith. Appreciation should 
also be expressed to all the members of the Psychology Department at the Tuscaloosa V. A. Hospital 
for their support and assistance. 
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their discussion for another 50 minutes. During this jatter session one or the other 
of the two trained judges observed and scored all of the responses so that an immed- 
iate measure of learning was obtained. 


REsULTs AND Discussion 

The agreement between the judges on Z and P responses during the pre-and- 
post-experiment two-hour sessions ranged between .72 and .99 with only two of 
eight correlations falling below .93. It seems safe to conclude that the judges were in 
essential agreement. Table 1 presents the pre-experimental rate of responding. It 
should be noted that each of the response classes exhibits minimal variability which 
is strikingly in keeping with prior findings“. 
TaBLE 1. AVERAGE hy enn AND PERCENTAGE (IN PARENTHESES) OF RESPONDING FOR ALL 

Sussects DuRING THE PRE-EXPERIMENTAL SESSION 


P G Total 


First hour 16.0 (50.7 6.1 (19.1 0.1 a3 2.1 iF .8) 24.3 
Second hour 13.8 (53.1 8.6 (19.8 4.8 (10.4) 27.2 


Average 14.9 (51.9) 7.4 (19.5) 0.3 (0.4) 3.5 (7.6) 25.75 


Conditioning. The data were combined into two-day units. A matched pair 
t-test between the first and last two-day units in terms of percentage of G for G 
reinforced Ss is significant beyond the .025 level of confidence. The increase in terms 
of frequency for this group was also significant beyond the .025 level on a comparison 


- of the first two to the last two sessions. 


For the P reinforced group a comparison on the percentage measure for the 
first and last two-day units yields a difference significant beyond the .025 level. This 
is not substantiated when the measure, increase in frequency, is examined. One 
subject decreased markedly in frequency of P-responses, cancelling out the moderate 
but consistent increase in frequency for the remaining five subjects. 


Generalization. The test of generalization occurred when the two groups of six 
Ss, which had been conditioned to different response categories (P or G), were re- 
united as in the original pre-test sessions. On both frequency and percentage in- 
crease, the G reinforced category approached acceptable confidence levels although 
neither measure is significant beyond the 5% level. No change on either measure is 
evidenced in the P-reinforced group. However, it should be noted that half of the 
P-reinforced subjects did not respond at all during the post-experimental sessions. 
The Ss who did respond showed an increment in the P category. 

It is very apparent that learning has occured in a group situation to a marked 
degree. It is also obvious that there was little generalization across situations. This 
finding is not particularly startling since it has often been subjectively noted that 
schizophrenics derive little lasting benefit from psychotherapy, even though they 
seem to respond within the therapy situation. Perhaps the logical ‘‘next steps’’ for 
research would be to investigate methods which would encourage appropriate and 
lasting generalization. 


SUMMARY 


Response categories were reliably measured in a group therapy-like situation 
with twelve schizophrenic Ss. The group was then divided and verbal elicitation and 
reinforcement were employed to increase either personal or group responses. Sig- 
nificant gains were obtained in both categories during conditioning, although the 
effect failed to generalize to any marked degree to another situation. 
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AN EXPLORATION OF PERFORMANCE MEASURES 
OF PERSONALITY! 


SEBASTIAN SANTOSTEFANO 
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PROBLEM 

In current personality theory there appears to be a trend away from emphasizing 
the so-called id or fantasy behavior of patients, with greater emphasis being given 
to instrumental or ego behavior, i.e., the behavior which patients may be expected 
to show in real situations of daily living. This shift has not yet been paralleled by 
corresponding changes in the philosophy and techniques of psychological testing. 
Situational judgments are usually made from tests which elicit primarily verbal 
responses to stimuli such as pictures. As Cronbach®? states, assessment today in- 
volves “hazardous” inferences, and he suggests that to improve prediction, tests 
should be used which duplicate the general factors underlying the day to day de- 
mands of the criterion situation. 

Many studies have investigated various forms of behavior, other than verbal, 
for personality assessment. During the 1930’s, studies of ‘expressive behavior’ 
received much attention especially from Werner Wolff®. Since World War II, two 
other approaches to the problem of performance testing of personality have appeared, 
one represented in the use of situational tests introduced by the OSS: *®), another 
in the work of Cattell®’. However, a performance test of personality which is 
feasible for use in ordinary clinic or research settings has not yet been produced. This 
investigator believes that performance testing of personality has potential values 
not yet fully realized. There is need for a testing method which (a) provides situa- 
tions requiring patients to cope with people and objects and encourages the free 
expression of personality attributes; (b) produces responses easily and unequivocally 
identified; (c) yields measures the meanings of which are subject to substantial 
consensual validation; and (d) is feasible for use in the usual clinic and research 
setting. 

The tests devised for this study were designed to evoke a choice of “coping” 
behavior“: 7), characterized by the S’s physically acting upon, manipulating or 
avoiding the objects in his environment. The individual responds to actual, physical 
stimuli of some potential significance to him rather than to conceptual or verbally 
defined stimuli. In devising the tests it was assumed that a person enters a situation 
with relatively stable tendencies to perceive and to respond selectively, endowing 
the stimuli with positive, negative, or neutral incentive values according to his 
unique personality characteristics. It was also assumed that personality character- 
istics are revealed, in some respect, by the stimulus materials he values or rejects 
and through the kinds of changes he physically imposes on these objects. 


METHOD 

Tasks were devised which required Ss to act upon stimulus material and which 
satisfied the following criteria: (a) the stimulus material should be administered on 
the top of a desk; (b) only a single act, which could be executed within a few seconds. 
should be sufficient to complete each task; (c) the meaning of a task should be agreed 
upon by competent judges; (d) the nature of a task should reduce the likelihood that 
the significance of the act performed would be apparent to S. 

The tasks were combined into pairs (Table 1), constituting miniature situational 
tests. Tasks were paired which required about the same amount of effort to com- 
plete, and also to insure that as wide a variety as possible of combinations of aspects 
of personality would be represented in the situations. The battery of tests employed 
in this study consisted of 41 situations or pairs of tasks.” 

tion by the author completed at Pennsylvania State 
Dor of the test end comprising the total test see (*), 
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Subjects. One hundred fifty male, high school students were selected randomly from 
the junior and senior classes of two high schools. 1t seemed desirable to investigate 
the tests devised using a fairly heterogeneous group in terms of abilities, interests, 
and personality. The Ss’s 1Q’s ranged from 76 to 135 and the various curricula in 
which they were enrolled included college, shop, agriculture, and commercial. Only 
males were tested to eliminate sex differences in response. 


Procedure. Each S was tested individually and asked to choose and perform one of 
each of the two ‘‘games”’ placed before him. He was urged to make his choices quick- 
ly and encouraged to make use of the feelings he experienced when confronted with 
any pair of tasks. To control for position preferences, the pairs of tasks were placed 
before one half of the group in specified left-right positions, and reversed for the 
second half of the group. The instructions preceeding each situation were reversed 
accordingly. 

The Ss’s choices in the situations were intercorrelated by setting up a 2 X 2 
contingency table for each of the possible combinations of pairs of choices, and 
computing for each an adjusted phi coefficient“. From, the matrix of intercorrela- 
tions, the 20 situations (Table 1) showing the most promise for defining common 
factors were selected and factored by Thurstone’s Centroid Method. 


Tasie 1. Parrs or Acts in Eacu oF THE TWENTY MINIATURE SITUATIONS SELECTED FOR A 
Factor ANALYSIS 


Tasks 


Examining with hand interior of a box 
Looking at a concealed picture 
Driving in a spike 
as out a spike 

nelipping paper 
Clipping paper together 
Tearing paper 

pairing paper 

Breaking a light bulb 

ribbling on paper 
Erasing a line 


Playing an easy unknown game 
Playing a hard unknown game 
Striking a sheet of glass 
Refusing to strike 

§ listening to Z’s heart 

E listening to S’s heart 
Scattering sawdust 

Sweeping sawdust 


1 
2 
3 
4 
5 
6 
7 
8 
9 


i=) 


RESULTS 
Six factors were extracted (Table 2) five of which seemed definable. The magni- 
tude of the cosines indicated no appreciable correlations among the five factors. 
Loadings of .40 or greater are given primary consideration in the interpretations. 
For each factor, the choice of the upper act of a situation tends to go with the choice 
of the upper act of each of the other situations in the same list; similarly, the choices 
of the lower acts go together. 


Situations and acts defining Factor I: ‘ 
Loading 
16. Breaking a large light bulb .80 
Breaking a small light bulb 
5. Breaking a light bulb 
Watering a plant 


Numbers m7 Numbers Tasks 
tieing E’s hands 
S gloving E’s hands 
12 Looking at a painting 
13 Reciting speech 
iting a 
Copying a speech 
4S E sorting cards 
S sorting cards for E 
15 S handcuffing 
E handcuffing S 
16 Breaking a large light bulb 
Breaking a small light bulb 
17 Being 
ing 
18 S reading to 
E reading to S 
19 Breaking a baloon 
Examining an unfamiliar object 
20 S shocking E 
E shocking S 
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TasLe 2. Roratep Factor Matrix 


| 


| 


—04 
29 
ll 


Note: All values above involve two decimal places. 


2. Driving in a spike 
Pulling out a spike 

8. Striking a sheet of glass 43 
Refusing to strike glass 

1. Examining with hand interior of box 37 


Looking at a concealed picture 


The acts contained by the upper members of the first four situations defining Factor 
1 represent overt, forceful aggression, while the group formed by the lower members 
suggests an avoidance of destructive acts. This avoidance is most clearly revealed 
by situation 8 where there is an out-right refusal to manifest an aggressive act when 
the opportunity to refuse is offered. Factor 1 seems to discriminate persons who 
prefer to aggress overtly and forcefully from those who are more inhibited and more 
reluctant to exhibit forceful aggression. 


Situations and acts defining Factor II: 


7. Playing an easy unknown game 
Playing a hard unknown game 
20. S shocking 
E shocking S 
19. Breaking a balloon 
Examining an unfamiliar object 
1. Looking at a concealed picture 
Examining with hand interior of box 


In interpreting the very high loading of situation 7, it is important to note that here 
S chooses between playing the “easy part” of a game and the “hard part” not 
knowing the nature of, nor as of, the task he is to perform. In situation 
20, it seems that the choice, S shocks £, is an avoidance of physical pain and fear 
rather than an act of aggression because of the situations in this cluster and the 
absence of other situations containing aggressive acts against EZ. It is also note- 
worthy that Ss who placed their hands in the box (situation 1) as well as Ss who 
avoided this choice, indicated they expected to find, for example, ‘‘a mouse trap’”’ or 
“broken glass.’’ Factor II seems to differentiate persons who avoid unfamiliar or 


Il III IV VI 
—34 16 —12 03 06 
—05 32 10 17 06 
12 —03 43 23 —32 
01 —12 51 —02 oo 
—06 04 20 —04 —18 
11 ll 49 31 04 
1,00 —03 00 16 —03 
05 —14 12 —02 —16 
—25 25 —09 ll —23 
10 - 00 09 63 61 03 
ll 00 —03 47 —20 —19 
12 13 23 29 —13 18 
13 07 64 23 00 —09 
14 30 —10 —09 47 05 
15 06 01 14 43 01 
16 04 05 03 02 —04 
17 —29 —17 —13 —20 —13 
18 —04 66 —09 01 23 
19 38 00 31 —09 33 
20 50 00 19 12 —30 
Loading 
1.00 
50 
38 
34 
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unknown stimuli which are viewed by them as potentially injurious, threatening, 
and difficult to handle from those who approach these stimuli. 


Situations and acts defining Factor III: 


18. S reading to 
E reading to S 

13. Reciting a speech .64 
Copying a speech 


In interpreting this factor it is necessary to consider the instructions of situation 13. 
Here E said, “‘A line from a speech is typed in this folder. Either read the line aloud, 
as if you were giving a speech, or copy it on this sheet of paper.’”” Many Ss who 
chose to read aloud, read the lines forcefully, and sometimes even histrionically. 
The upper members of the situations defining Factor III suggest a displaying of the 
self in conspicuous, solo, verbal behavior; the lower members suggest a preference 
for passive, inconspicuous behavior. 


Situations and acts defining Factor IV: 


Loading 
66 


Loading 
.63 


10. Scattering sawdust 
Sweeping sawdust 
4. Tearing paper 
Repairing paper 
6. Scribbling on paper 
Erasing a line 
11. S tieing E’s hands 
S Gloving E’s hands 
3. Unclipping paper 
Clipping paper together 


With the exception of situation 11, the upper members of Factor IV contain sym- 
bolic acts of aggression as contrasted with the overt, forceful aggression represented 
in Factor I. The behavior of the lower members uniformly represents order and 
neatness. 


Situations and acts defining Factor V: 
Loading 
10. Scattering sawdust 61 
Sweeping sawdust 
14. S timing £ sorting cards AT 
S sorting cards for E 
15. S handcuffing 43 
E handcuffing S 


Because situation 10 also contributes to Factor IV, the description of Factor V is 
considered to be tentative. By timing and hand-cuffing F (situations 14 and 15), S 
seems fo exercise some degree of control over EZ, while the alternates suggest com- 
pliance and acceptance of control exercised by others. 

Factor VI is defined by situations with low loadings and does not seem to have 
any particular psychological significance. 


Discussion 


These results are promising for the development of a performance test of person- 
ality offering some unique advantages. In addition to yielding a profile of choices 
which have been acted-out, the method should enable an examiner to observe a 
personality in action including characteristic and unique ways of approaching, 
avoiding and handling actual stimulus material and situations. Whether these acts 
represent ego functioning and how they are related to more unconscious responses 
remains for further investigation. 


51 
49 
A7 
43 
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It is recognized that the method places some unknown degree of restriction on 
the freedom of expression allowed an individual. The use of brief, single acts has 
advantages in administration but may preclude the appearance of certain character- 
istics of behavior which require continuity for expression. Further restriction is im- 
posed by the symbolic quality of the tasks which is necessary to mask some of the 
significance of the act chosen. 


SuMMARY 

The possibilities of assessing personality by means of performance tests were 
explored by devising tasks eliciting unitary, easily identified acts judged to have 
psychological meaning and which satisfied criteria of space and time. These tasks 
were paired to form miniature situations which were presented to Ss who chose and 
actually performed one of the two tasks in each situation. The Ss choices were inter- 
correlated and factored. The five factors obtained were labeled: (a) expressing overt 
forceful aggression vs. inhibiting overt forceful aggression; (b) avoiding unknown 
stimuli interpreted by Ss as potentially injurious and difficult vs. approaching these 
stimuli; (c) displaying the self in conspicuous, solo behavior vs. avoiding such display; 
(d) expressing symbolic aggression vs. expressing order and cleanliness; (e) exercising 
control over others vs. accepting the control of others. 


-~ 
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THE MARITAL ROLES INVENTORY AND THE MEASUREMENT 
OF MARITAL ADJUSTMENT 


NATHAN HURVITZ 

Buckingham Road 
Angeles, California 

PROBLEM | 


The present paper reports the use of a Marital Roles Inventory to measure 
marital adjustment. This Inventory is an indirect type of scale based upon the 
interaction of the marital roles of the spouses and which thus assesses marital adjust- 
ment rather than the personality traits associated with marital adjustment. 

The Marital Roles Inventory tests marital adjustment within the theoretical 
framework of role theory and is based upon the conedpt that the family as an institu- 
tion is an organization of roles. Within the family; the roles of the husband-and- 
father and wife-and-mother are units of conduct which stand out as regularities by 
their recurrence and form patterns of mutually oriented conduct). These roles, 
constituted into a role-set), have two aspects: each spouse’s performance of the 
roles in his own role-set, and each spouse’s expectation of how the other spouse will 
perform the roles in his role-set. This complementary and reciprocal relationship of 
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role performances and role expectations makes up the social structure of the family. 
One aspect of marital adjustment, therefore, results from the compatibility between 
role performances and role expectations. 


INSTRUMENT AND METHOD 

The Marital Roles Inventory lists in a random order and on separate pages the 
role-sets of the husband and wife; and each spouse is given a list of the roles in his 
own role-set as well as a list of the roles in his spouse’s role-set. Following are the 
husband’s roles in his own role-set which are presented to him to rank as his role 
performances. In brackets is the rewording of the husband’s role-set which is pre- 
sented to his wife to rank as her role expectations. 
I do my [He does his] jobs around the house. 
I am [He is] a companion to my [his] wife. 
I help [He helps] the children grow by being their friend, teacher, and guide. 
I earn [He earns] the living and support [supports] the family. 
I do my [He does his] wife’s work around the house if my [his] help is needed. 
I practice [He practices] the family religion or philosophy. 
I am [He is] a sexual partner to my [his] wife. 
I serve [He serves] as the model of men for my [his] children. 
I decide [He decides] when the family is still divided after discussing something. 
I represent and advance [He represents and advances] my [his] family in the community. 


Following are thé wife’s roles in her own role-set which are presented to her to 
rank as her role performances. In brackets is the rewording of the wife’s role-set 
which is presented to her husband to rank as his role expectations. 

I help [She helps] earn the living when my [her] husband needs my [her] help or when the 
amily needs more money. 

I practice [She practices] the family religion or philosophy. 

I care [She cares] for the children’s everyday needs. 

I am [She is] a companion to my [her] husband. 

I am [She is] the homemaker. 

I am [She is] a sexual partner to my [her] husband. 

I serve [She serves] as the model of women for my [her] children. 

I represent and advance [She represents and advances] my [her] family socially in and 
the community. 

T help [She helps] the children grow by being their friend, teacher, and guide. 


Two different sets of instructions are given for completing the Marital Roles 
Inventory. One set requires each subject to number the roles in his own role-set “‘in 
the order of importance in which you actually carry out your roles or functions in 
your family at the present time.”” This procedure gives the rank order of the sub- 
ject’s performance of his own roles. The second set requires each subject to number 
the roles in his spouse’s role-set ‘‘in the order of importance you want or prefer your 
wife (husband) to carry out her (his) roles or functions in your family at the present 
time.”” This procedure gives the rank order of the subject’s expectations of his 
spouse’s roles. One spouse’s rank order of the roles in his own role-set as he actually 
performs his roles can then be compared with his spouse’s rank order of the roles in 
the same role-set as the other spouse expects or prefers these roles to be performed. 
The difference between the rank order of the roles in the role-set of a husband or 
wife, ranked as role performances by one spouse and as role expectations by the 
other spouse, is the Index of Strain. 

The Index of Strain is therefore a measure of the difference between the rank 
orders that a.pair of spouses assign to the roles in a particular role-set in the Marital 
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Roles Inventory. The husband’s Index of Strain is the difference between the hus- 
band’s and wife’s ranking of the husband’s roles; and the wife’s Index of Strain is 
the difference between the husband’s and wife’s ranking of the wife’s roles. If a 
husband and wife pair assign the same rank order to the roles in the husband’s role- 
set or in the wife’s role-set it is assumed that there is no discrepancy between role 
performances and role expectations indicating marital adjustment. However, to the 
extent that a husband and wife pair assign different rank orders to the roles in the 
husband’s role-set or in the wife’s role-set it is assumed that there are discrepancies 
between role performances and role expectations indicating marital maladjustment. 

The Index of Strain is computed by taking the cube root of the sum of the 
cubes of the differences between the ranks the spouses assign to each role. With this 
method the range in the Index of Strain for both spouses is from .0 to 9.9. The Index 
of Strain was also computed by the summation of the differences between the ranks 
assigned to each role. The correlation between the Indexes of Strain derived by the 
cubing method and by the summation method has an r of .91 and is significant be- 
yond the .001 level. This suggests that the two methods of deriving the Index of 
— produce the same result and that the simpler summation method should be 

SAMPLE 

The Marital Roles Inventory and the Index of Strain were devised for and 
utilized in a study of marital roles and adjustment in marriage in a random sample 
of 104 middle-class couples“. The data were collected by the writer through inter- 
views with the subjects in their homes. Couples who agreed to participate in the 
study were interviewed together, and the husband and wife completed separate 
questionnaires simultaneously but independently in the presence of the writer. 

The mean age of the husbands in the sample is approximately 40 and of the 
wives is approximately 35; and the mean length of marriage to the present spouse is 
12.5 years. The modal family group is composed of a couple married 8 to 14 years 
with two children. Half the sample is Jewish; Protestants constitute more than a 
third; Catholics about ten per cent; and five per cent indicated no religious affiliation 
or preference. Forty per cent of the husbands and nearly 30 per cent of the wives are 
college graduates; one third of the husbands are professionals; another third are 
business grea or managers; and the mean income for these families was $9,615.00 
in 1957-58. 


RESULTS 


The mean Index of Strain for the husbands in the sample is 6.09 and for the 
wives 5.26 (t = 3.46, p < .001), indicating that the wives have significantly lower 
Indexes of Strain than their husbands. The wives have greater unanimity about the 
rank order of their marital roles and they perform their roles more in accord with 
their husbands’ expectations of them than the husbands perform their roles in accord 
with their wives’ expectations of them. 

The distribution of the husbands’ and wives’ Indexes of Strain has a significant 
positive correlation (r = .20, p < .05), indicating that there is a tendency for both 
spouses to have either high or low Indexes of Strain. Spouses who indicate agreement 
about the rank order of the roles in the role-set of one spouse also indicate agreement 
about the rank order of the roles in the role-set of the other spouse; while spouses 
who indicate disagreement about the rank order of the roles in the role-set of one 
> ae also indicate disagreement about the rank order of the roles in the role-set of 
the other spouse. 


Although serious questions“) have been raised about the concept of marital 
adjustment as measured by a marital adjustment scale, the Indexes of Strain of the 
104 husbands and wives were compared with their scores on a marital adjustment or 
happiness in marriage scale“). Significant inverse correlations were found between 
the husbands’ Indexes of Strain and the marital adjustment scores of the husbands 
(r = —.22, p < .05) and wives (r = .23, p <.05). Significant correlations were not 
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found between the wives’ Indexes of Strain and the marital adjustment scores of the 
husbands (r = —.07, p > .05) and wives (r = —.08, p > .05). Lack of strain be- 
tween the performance and expectation of the husbands’ roles is thus associated with 
marital adjustment as measured by a previously validated instrument. The fact 
that a similar association was not found for the wives may be explained by inad- 
equacies in both the Marital Roles Inventory and the marital adjustment scale, and 
by the special characteristics of the sample studied, rather than by a lack of associa- 
tion between the two variables. 

As a result of the present study it is suggested that the Marital Roles Inventory 
can be made more effective by adding “1 help manage the family income and 
finances,”’ to the role-sets of both spouses; and “I decide when the family is still 
divided after discussing something,’’ to the wife’s role-set. 


SUMMARY 

The present paper reported the use of a Marital Roles Inventory to measure 
marital adjustment. This Inventory is based upon role theory and the assumption 
that spouses whose rank order of role performances and role expectations are similar 
are adjusted in marriage while spouses whose rank order of role performances and 
role expectations are different are not, adjusted in marriage. The difference between 
the rank orders a pair of spouses assign to a role-set is the Index of Strain. 

Utilized with a middle-class sample the Marital Roles Inventory revealed that 
the wives have significantly lower Indexes of Strain than the husbands and that both 
spouses tend to have either high or low Indexes of Strain. Significant inverse associa- 
tions were found between the husbands’ Indexes of Strain and the marital adjustment 
scores of the husbands and wives; such associations were not found for the wives’ 
Indexes of Strain. Suggested modifications of the Marital Roles Inventory may 
result in more effective testing of marital adjustment by this method. 
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THE INTROVERSION-EXTRAVERSION FACTOR AND 
SOCIAL DESIRABILITY 


EDWARD F. GOCKA 
Veterans Administration Hospital, American Lake, Washington 


PROBLEM 

Edwards, Heathers, and Fordyce“? found that if an MMPI scale has a fair 
degree of balance present in its keying for socially desirable and undesirable re- 
sponses, it will correlate low with the MMPI Social Desirability (SD) scale. Further- 
more, if a scale were keyed primarily in the socially desirable direction, then it 
would correlate substantially in a positive fashion with SD; conversely, if keyed 
primarily in the socially undesirable direction, then it would have a large negative 
correlation with SD. These authors used an indirect approach for obtaining the 
socially desirable response direction of each item. Edwards®) and many others 
found that the probability of endorsement of an item in a personality inventory is 
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positively and highly correlated with the social desirability scale value of the item; 
hence, they argued logically that the most frequent self descriptive response of a 
large normative group was sufficient to establish the socially desirable response 
— of an item in most cases. This rationale will be maintained in the present 
study. 

Since, in the MMPI, an item which is keyed in the same direction as the 
majority response of the standardizing Minnesota normal groups is designated as an 
O item, while one keyed in the opposite direction as an X item, some direct pre- 
dictions of scale correlations with SD could be made on the basis of the X-O pro- 
portions in that scale. In fact, as expected, the above mentioned authors found high 
positive correlations between the SD scale and other scales having a substantial 
proportion of their items keyed in the O direction; conversely, large negative cor- 
relations were found for those scales having a large proportion of their items keyed 
in the X direction, and scales with a more equal balance between the proportions of 
X and O were found to have low or near zero correlations with SD. In verifying their 
expectations, the authors found a linear relationship between the SD correlations of 
the scales, and the proportion of X items in the scales; i.e., the product moment 
correlation between the SD correlations and the proportion of X items was —.96. 
The results were interpreted as indicating that if a scale has a fair degree of X-O 
balance, then scores for the scale will tend to have relatively low correlations with 
scores on the SD scale. However, as they pointed out, low correlations with the 
SD scale might also be obtained because the items in the scale are relatively neutral 
with respect to their social desirability scale values. 

This study investigated whether the correlation pattern of the Introversion- 
Extraversion constellation of MMPI scales, which have been found to correlate low 
with SD and also have been identified as a fairly independent second order factorial 
dimension “, could be entirely accounted for by X-O item balance. MMPI scales 
found to load heavily on this dimension and to overlap little with each other are: 
Welsh’s® second factor (R); Cuadra’s“ control (CN) factor; Block’s ® ego control 
(EC3) factor; Mees’ repressive-expressive (M2) factor; and Rozynko’s“ inter- 
nalization-externalization (IE3) scale. 


Meruop 


Since each of the Introvert-Extravert scales has approximately equal balance 
between X and O items, 100% X subscales and 100% O subscales could be formed. 
For example, the 50 item Cn scale could be broken down into a 29 item Cn-X sub- 
scale and a 21 item Cn-O subscale. Now if keying of the items was the sole determin- 
ant underlying the near-zero r’s with the SD scale, then we would expect the X sub- 
scales to have large negative correlations with SD and the O subscales to have large 
positive correlations with SD. The 10 resulting subscales from the Introversion- 
Extraversion constellation were correlated with the SD scale in order to determine 
the effects of X-O balance. 

Our Ss were 84 hospitalized male patients with a former diagnosis of schizo- 
phrenia but were scheduled for discharge at the time of testing. A replication of the 
Edwards et al.“ study was carried out first, using a number of the same scales. The 
results were essentially the same, indicating that the forthcoming findings could not 
be interpreted in terms of population differences. 


RESULTS AND DISCUSSION 


The SD correlations of the total Introversion-Extraversion scales as well as the 
X portion and O portion subscales are oe in Table 1. In general the X portion 
subscales correlate more negatively with SD than do the total scales, while the O 
portions correlate more positively. This indicates that X-O balance is reflected in 
the composition of the introversion-extraversion factor. The absolute sizes of the 
r’s, however, are not what was expected from the linear relationship between percent 
of X items and correlations with SD even if some attenuation were made for in- 
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TaBLe 1. CorrELaTIoNs oF MMPI InTROVERSION-EXTRAVERSION 
ScaLEs AND SuBSCALES wiTH SD (N = 84). 


SD vs. SD vs. SD vs. 
Scales Total Scale X Portion O Portion 


R -12 .34 
M2 -05 31 -23 
TE3 34 .52 


Cn — .24 46 -27 


creasing the test lengths. The results of Table 1 are presented in Figure 1, where the 
proportion of X items is plotted along the abcissa and the correlation with SD along 
the ordinate. A rough approximation to the slope expected from our initial analysis 
and from the Edwards’ et al, ) data is given in the dashed line. The diagram shows 
yy balance has some effect on the low r’s with SD, but that this is only a portion of 
the effect. 


Fic. 1. Between CorrRELA- 
TIons oF MMPI InrrRovERSION-EXTRAVERSION 
ScaLEs AND SusscaLes tHE SD Sca.e 
(ORDINATE) AND THE PROPORTION OF X ITEMS 
IN THE ScALEs (ABSCISSA). 


Correlations with SD 


It is interesting to note that R, M2, and EC3, showed least X-O balance effects 
and that in a previous factor study “, using the same subjects, they were the scales 
with the largest loadings on the Introversion-Extraversion dimension. Since EC3 
did not show any appreciable X-O balance effects, it was selected for comparison with 
the SD scale with regard to the overall neutrality of the items in each scale; that is, 
it was expected that the EC3 items would have a more equal balance, in general, 
between the proportion of people endorsing the item and the proportion not en- 
dorsing the item. That this was so was determined from the fact that 68% of the 
EC3 items have a response balance between 30-70%, as indicated by the Minnesota 
normal males, while only 23% of the items in the SD scale fall within this range. We 
may conclude that the EC3 items, therefore, are less extreme items. 


SuMMARY 


When MMPI scales loading on an Introversion-Extraversion dimension are 
divided into subscales made up totally of either X type items or O type items and 
correlated with the MMPI social desirability scale (SD), the hypothesis that X-O 
balance accounts entirely for a low correlation between a total scale and SD is not 
verified. The alternative conclusion is that the items are more neutral with regard 
to social desirability. That this seems to be the case holds on a gross level for the 
ego control (EC3) scale which has more neutral items in general than does the SD 
scale. It is apparent, however, that social desirability effects do enter, in a small 
way, into the formation of this factor by MMPI scales. These effects then, would 
introduce a small bias in a factor analysis by making the introversion-extraversion 
factor more independent of the SD factor than it probably is in nature. 
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THE ROLE OF NEED-ATTITUDES IN ADJUSTMENT 
JAY L. CHAMBERS AND LOUIS J. BROUSSARD 


Charles L. Mix Memorial Fund, Inc. 
Americus, Georgia 


PROBLEM 
A previous study“? investigated the attitudes of normal and paranoid schizo- 
phrenic men toward needs of the Murray ®? need system. For the present study, the 
need-attitudes of two new clinical groups were mesaured by the same technique 


previously described. Cluster analyses of the need-attitudes of the two groups of the 
previous study and the two groups of the present study were examined by a method 
designed to reveal quantitative similarities and differences between cluster configur- 
ations. It was expected that a systematic comparison of the need-attitude clusters 
of normal and maladjusted groups would provide indications of the role of need- 
attitudes in adjustment. 


METHOD 


Subjects. Normal subjects for the study were obtained from Americus and 
Albany, Georgia, from Lexington, Kentucky and from the rural areas surrounding 
these three communities. Clinical subjects were obtained from five different Vet- 
erans Administration and state mental hospitals and from several out-patient 
clinics located in different cities. The clinical patients were selected on the basis 
of a clear cut psychiatric diagnosis. The normal subjects had no history of mental 
disorder or severe emotional disturbance. 

Each of the four groups in the present study was composed of 100 adult males. 
The normal subjects had an average age of 34 and a mean education level of 12.4 
years. An alcoholic group had an average age of 39 and a mean education level of 
11.7 years. A paranoid schizophrenic group had an average age of 38 and a mean 
education level of 11.5 years. A chronic undifferentiated schizophrenic group had 
an average age of 35 and a mean education level of 11.2 years. The standard de- 
viations for age ranged from 7.8 to 10.8 years for the four groups while the standard 
deviations for education levels ranged from 2.3 to 2.9 years. 

Some significant differences in age and education occurred between groups. 
The most reliable mean difference in age (t > .01) was five years between normals 
and alcoholics and the most reliable mean difference in education (¢ > .01) was 1.2 
years between normals and undifferentiated schizophrenics. The possible effects of 
differences in age and/or education must be considered in interpreting the data 
although no effects attributable to these factors were readily discernible. 
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Picture Identification Test. Briefly, this test“) gives quantifiable measures of 
the frequency a S attributes need descriptions to his own selections of liked and 
disliked photographs of people. Consistent sorting of a need description with liked 
pictures was considered evidence of a positive attitude of the S toward that need. 
Consistent sorting of a need description with disliked photographs was taken as 
evidence of a negative attitude of the S toward the need. 8’s sorting scores for the 
21 Murray needs were ordered from most positive to most negative scores. The 
rank of a need was taken as an index of the relative strength of the 8’s attitude 
toward the need. Interpreting rank scores of the needs as indices of attitudes toward 
needs seemed more defensible than a claim that the needs per se were being measured. 

A measure of variability (from chance sorting) and a measure of consistency 
(tendency to sort more with liked than disliked photographs) were both computed 
for each subject. Two equivalent forms of the test were used. In each group, half 
of the subjects took one form and the other half took the alternate form. The 
combined data from both tests were used in the analyses. 


RESULTS 


Previous study indicated that a configural approach was the most fruitful for 
revealing the significance of the various need-attitudes for the different groups. 


1. Ciuster ANALysis oF oF 100 ALCOHOLIC MALES 
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Intercorrelations between all 21 need-attitude scores and the variability and con- 
sistency measures were computed for the alcoholic and for the chronic undiffer- 
entiated schizophrenic groups. The correlation matrix for each group represented 
— correlations (via the Z transformation) of two samples with 50 Ss per 
sample. 

The two matrices were cluster analyzed. Tables 1 and 2 present the cluster 
analyses for the alcoholic and chronic undifferentiated schizophrenic groups re- 
spectively. The procedure was an exact replica of the cluster analyses of need- 
attitude data for normal and paranoid schizophrenic Ss presented previously“. 
It was possible, therefore, to reexamine the previous data and compare the need- 
attitude clusters of each of three clinical groups with those of normals. 


TaBLe 2. Ciuster ANALYSIS OF NEED-ATTITUDES OF 100 UNDIFFERENTIATED 
ScHIzOPHRENIC MALES 


Need-Attitude 


Defendance 
Harmavoidance 
Autonomy 
Inferiority-avoidance 


Need-Attitude 


Affiliation 


Nurturance 
Achievement 
Order 


Understanding 


Deference 
Achievement 
Order 


Inferiority-avoidance 


Dominance 
Cluster 4 Abasement 


Succorance 
Cluster 5 Harmavoidance 
Exhibition 
Cluster 6 Aggression 
Exhibition 
Cluster 7 Aggression 


jon 


Cluster 8 Autonomy 


Succorance 


Cluster 9 


Rejection 


Cluster 10 


++] +] ++ 


Harmavoidance 
Cluster 11 Consistency 


A quantitative method was devised to show how each of the clinical groups had 
the same or different cluster combinations for the test variables as were found for 
normals. The method is illustrated as follows. First, each variable appearing in the 
alcoholics’ clusters was checked to see with which other variables it combined. For 
example, Harmavoidance, in alcoholic cluster 1, correlated with 10 other variables 
in that cluster and these were all recorded along with any new combinations ap- 
pearing in alcoholic clusters 2 and 8 where Harmavoidance was also present. The 


Avg. r 
Deference — .36 
Nurturance — .32 
Order — .30 
Cluster 1 = 
ggression nderstandi -. 
Succorance — .26 
ion -. 
y — .32 
—.31 
Autonomy ™ 
Cluster 3 Sex 
' 
Agg 


386 JAY L. CHAMBERS AND LOUIS J. BROUSSARD 


normal cluster correlations for Harmavoidance were likewise recorded and then the 
two sets of correlations were compared to see which correlations were shared and 
which were unshared by the two groups. Unshared correlations were classified 
according to whether normals had the combination and alcoholics lacked it or vice 


versa. All the other need-attitudes were analyzed the same way. Cluster combina- 
tions of each of the three clinical groups were compared with those of normals. 


Discussion 


Several questions and hypotheses were suggested by the data. First, why do 
some need-attitudes cluster with a number of others for a particular group while 
other need-attitudes show few or no cluster combinations? One possible answer is 
that the more widely a need-attitude is interrelated in the cluster a of a group, 
the more the group has prepared for the satisfaction, expression, inhibition, or chan- 
neling of the need. The combinations of a widely related need-attitude would thus 
represent a complex system devised to cope with problems associated with the need. 
If this proposition is valid, normals should be most concerned with such needs as 
Aggression, Autonomy, Exhibition, and Sex since they have the most cluster com- 
binations for these needs. Normals have the least number of cluster combinations for 
and should therefore be least concerned with needs for Counteraction, Inferiority- 
avoidance, Sentience, and Succorance")?. 

Second, it may be assumed that the need-attitude system of normals reflects 
more realistic aud mature solutions for adaptation to our culture than the systems 
developed by any of the clinical groups. The kinds of departures that clinical groups 
make from the normal pattern can thus be used as indicators of maladaptive at- 
tempts to cope with need problems. For therapeutic change, therefore, clinical 
groups should unlearn their own peculiar attitude patterns and learn the normal 
patterns they lack. 

If the above speculations are applied to the alcoholic clusters, data suggest that 
alcoholics require the most unlearning for Succorance and Inferiority-avoidance 
needs as they hold 14 and 11 need-attitude combinations for these needs which 
normals do not hold. Alcoholics have the most new learning to do for Blamavoid- 
ance and Play needs as they lack 9 and 8 normal attitude combinations for these 
needs. Lacking normal attitudes to enable them to enjoy ordinary relaxation 
(n Play) and lacking normal checks and controls from superego or blame avoid- 
ance attitudes, inebriates may find that alcohol artificially induces an acceptable 
and pleasurable escape from their ineffective efforts to cope with problems of de- 
pendency (n Succorance) and inadequacy (n Inferiority-avoidance). 

Similar analyses applied to the other clinical groups indicate that paranoid 
schizophrenics and chronic undifferentiated schizophrenics show similarities in their 
new learning and unlearning requirements. Both groups of schizophrenics have the 
most new learning requirements for Blamavoidance and Exhibition needs and both 
need to unlearn attitudes connected with Inferiority-avoidance. The most striking 
difference between the two schizophrenic groups is with regard to attitudes related 
to n Succorance. Chronic undifferentiated schizophrenics have devised a complex 
but abnormal system of attitudes to cope with Succorance needs (13 non-normal 
need-attitude combinations) whereas the paranoids show no special concern with 
this need (0 non-normal need-attitude combinations). 

Attending to the similarities between the two schizophrenic groups, one could 
rationalize that delusions of persecution may represent the schizophrenics’ primitive 
substitutes for normal attitudes controlling Blamavoidance needs. Delusions of 
grandeur might represent primitive substitutes for the lack of normal controlling 
attitudes for exhibitionistic needs. Autistic withdrawal may reflect the schizo- 
ao extensive but abnormal defense preparations to avoid exposure of in- 
eriority feelings. 

Theoretically, the most pathological need-attitude problems for any group 
would center around those needs showing the most deviations from normal stand- 
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ards based on both new learning and unlearning requirements. Accordingly, the 
three most troublesome needs for alcoholics were: Succorance (17 unshared need- 
attitude combinations), Harmavoidance (15), and Defendance (14). The three most 
troublesome needs for paranoid schizophrenics were: Aggression and Blamavoidance 
(14 unshared combinations), and Achievement (13). The four most troublesome 
needs for chronic undifferentiated schizophrenics were: Succorance (17 unshared 
combinations), Harmavoidance (15), Exhibition and Order (12 each). This view 
of the data shows that alcoholics and chronic undifferentiated schizophrenics were 
similarly oriented around problems of passivity and fearfulness (n Suc and n Har) 
while the paranoid’s problems were oriented more toward active striving (n Ach), 
hostility (n Agg), and guilt (n Bla). 


SUMMARY 


Cluster analyses of the need-attitudes of normal and paranoid schizophrenic 
males from a previous study and alcoholic and chronic undifferentiated schizo- 
phrenic males from the present study were examined by 2 method designed to reveal 
quantitative similarities and differences between cluster configurations. 

Some theoretical propositions as to the role of need-attitudes in adjustment 
were formulated as follows: (a) The more widely a need-attitude is interrelated with 
other attitudes in the cluster system of a group, the more extensive the preparations 
of the group for coping with problems associated with the need. (b) If the need- 
attitude cluster system of normals is accepted as the most desirable model, clinical 
groups should learn the normal patterns they lack and unlearn their peculiar need- 
attitude patterns. 

The data, interpreted according to the above hypotheses, suggested that, in 
order to conform to normal patterns, alcoholics a most change in attitudes 
associated with Succorance, Harmavoidance, and Defendance needs; paranoid 
schizophrenics required most change in attitude patterns related to Aggression, 
Blamavoidance, and Achievement needs; chronic undifferentiated schizophrenics 
required most change in attitude patterns related to Succorance, Harmavoidance, 
Exhibition, and Order needs. 
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DEPRECIATION AND THE SELF CONCEPT! 
MICHAEL MOSES AND REX DUVALL 
University of Texas 


The basic motivation according to Rogers’®) will tend to estimate their performance more 
self theory is the maintenance and enhancement 


of the self. Each individual develops a percep- 
tion of self as a result of direct experience and 
evaluation by others. When an ience oc- 
curs that is incongruent with the self-concept, it 
may be denied symbolization or given a di 

torted symbolization. Thus, individuals with a 
high discrepancy between and ideal concepts 
will tend to perceive experience in accord with 
that discrepancy. In the present study it is pre- 
dicted that persons with high self-ideal discrep- 
ancy will underestimate their performance on a 
task while persons with low self-ideal discrepancy 


accurately. 


Subjects. The Worchel “ Self-Activity Invento 
(SAI) was administered to 250 students 4 
Introductory Psychology classes at the Univer- 
sity of Texas. The SAI consists of 54 statements 
which describe responses to the arousal of 
hostility, achievement, sexual and dependency 


1The authors wish to thank Drs. Philip Worchel 
and Robert Young for their encouragement and 
aid in preparing this paper. 
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needs. The S is required to indicate on a 5-point 
scale, from 1 indicating never to 5 mceeing 
always, how much of the time the descri 
activity is like him (Self), and how much of the 
time it is like how he would like to be (Ideal). 
Self-ideal discrepancy scores for each S are ob- 
tained by summing the absolute discrepancy 
scores on each item. 

For the present study, 14 males and 10 females 
who had scores from among the highest 30 self- 
ideal discrepancy scores were chosen to serve as 
Ss in the high group and 13 males and 11 females 
who had scores from among the lowest 30 self- 
ideal discrepancy scores were chosen for the low 
group. 

Experimental ign. A modified “Humphries 
board’’ was utilized in a three-choice situation in 
which one light went on randomly 66 per cent of 
the time; one light, 33 per cent of the time; and 
the third, 0 per cent. Standard instructions 
adapted from Estes and Straughan “) were pre- 
sented to Ss, all being run individually. Ss were 
told to predict which of the three lights would A 
on following a “ready” signal by pressing 
corresponding key. 

Five random sequences of events in 42 trial 
blocks were used for both the high and the low 

ups, giving a total of 210 trials for each S. 
Back of the reinforcing events was cdunter- 
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matching. The obtained ¢’s of .26 for the low 
group and .50 for the high group do not approach 
ignificance at the .05 level, and indicate that Ss 
were event matching. The results in terms of 
mean number of correct guesses and mean 
estimated number of correct guesses for the low 
and for the high groups are given in Table 1. 

A t test was computed between the means of 
the correct guesses for the high group and those 
for the low group, resulting in a ¢ of .30, indicat- 
ing that difference in performance between the 
low and the high groups is not significant. A 
median test“) was applied to the estimates of 
number of correct guesses for the low and for the 
high groups and the obtained chi-square was 
found to equal 4.08 with one df, which is signi- 
ficant beyond the .05 level. 


SuMMARY 

It was predicted that persons with high self- 
ideal discrepancy would tend to depreciate their 
performance on a task, whereas persons with low 
self-ideal discrepancy would tend to estimate 
their performance more accurately. The Worchel 
SAI was used to establish self-ideal discrepancy 
and two groups of 24 Ss each, one group compos- 
ed of students having extremely high self-ideal 
discrepancy and one group com of students 
having extremely low self-i discrepancy, 


1. Mean Correct GuEsses AND Estrmmatep Correct Guesses For 
' Tue Two Groups oN A PROBABILITY LEARNING Task 


Correct guesses 


Group 


8D 


Estimated 
correct guesses 


8D 


High S-I 
Low S-I 


108 .25 
109.04 


8.89— 
9.00+ 


76.875 
95.58 


34.59+ 
30.25 — 


balanced over lights to control for — bias, 
which required six Ss for complete counter- 
balancing. Four replications of the counter- 
balanced orders were used, requiring a total of 
24 Ss in each group. 


Procedure. After instructions were read to each 
S, four practice trials were given and any ques- 
tions concerning the task were answered. The 
Ss were then given 210 trials, following which 
they were asked the following question: “There 
were 210 trials. How many do you think you got 
right? The number of times Ss pushed the 66 
per cent key, number of correct responses and 
estimated number of correct responses were 


Resvutts Discussion 
For the 66 per cent light, ¢ tests were run over 
the last 42 trials for both the high and the low 
groups in order to compare the guesses of the Ss 
with what would be expected if the Ss were event 


were run on a three-choice probability learning 
task. Difference in performance between the two 
groups was not significant, while difference in 
estimation of performance between the two 
groups was significant. It was found that Ss 
with high self-ideal discrepancy tended to de- 
preciate their performance, while Ss with low 
self-ideal discrepancy tended to estimate their 
performance more accurately. 
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A NOTE ON AUTHORITARIANISM AND ATTITUDES 
TOWARD PARENTS 


E. JERRY PHARES 
Kansas State University ~ 


PROBLEM 


This study attempts to relate the construct “anti-democratic” personality, as 
measured by a modification of the California F Scale, to specific responses derived 
independently of that scale to determine the construct validity of the F Scale. Since 
the publication of the Authoritarian Personality“, a plethora of F Scale research 
has resulted ®. However, there still appears to be need for study of the construct 
validity of the scale. For example, Hyman nand Sheatsley “? point out that the valid- 
ity of the original questionnaire data of Adorno et. al. is seriously impaired because 
the criterion (personal interview) was not independently derived. 

From an examination of the underlying theory of the anti-democratic person- 
ality certain differences between high F and low F Ss would be expected along the 
dimension of appraisal of parents. For example, high F Ss should idealize authority 
and be less critical of its negative aspects. They should evince convent’ al atti- 
tudes toward parents and an overestimation of parental virtues at the e.pense of 
parental shortcomings. The foregoing comments should characterize low F Ss to a 
much lesser degree. With the above considerations in mind, the following hypothesis 
was derived: When high and low F Ss are asked to list both positive and negative 
qualities of their fathers and mothers, low F Ss will list a higher proportion of nega- 
tive qualities than will high F Ss. 

ently, the F Scale has been scrutinized regarding its built-in tendency to 
elicit response set ®: * 4. 7, 9), For this study, a modified F scale was constructed 
consisting of 15 positively worded items (F +)! and 15 negatively worded or reversed 
items (F —)*. The F— items are a synthesis of the reversals used by Chapman and 
Campbell ®) and Jackson and Messick“. 


MeETHOD 


Thirty-one females and 33 males from one class in Exceptional Children and 
two classes in General Psychology were used as Ss. The procedure was carried out 
in the classroom situation in each case. 

Initially, all Ss filled out the modified F Scale. Next, Ss were told they were 
being asked to help standardize a new “‘speed of association’’ test. The instructions 
were: ‘You are going to be given five minutes in which to list, one after the other, as 
many negative characteristics of your father as you can. Then, after this, you will 
be given another five minutes in which to list as many negative characteristics of 
your mother as you can. And next, you will be asked to follow the same procedure in 
regard to positive characteristics of your father and mother, respectively.” 

E then clarified any questions and particularly emphasized that the information 
was being obtained anonymously and would in no way reflect upon the Ss. Sheets 
were numbered so that F Scale protocols could be matched with father-mother 
protocols; no names ap ed. At the end of five minutes Ss were stopped and the 
next five minute Salat Gale, mother) began. After this, a similar procedure was 
followed for positive characteristics. 

Two judges examined each father-mother protocol and determined, for each 
S, the total number of responses in each of the four categories. The judges were not 
aware of Ss’ F Scale scores. Their average agreement was 98%. 


1From Form 40, items 1, 9, 12, 13, 19, 21, 23, 25, 26, 27, 29, 35, 37, 38, and 44. 
*From Form 60, item 41; from Form 40, items 2, 4, 6, 8, 16, 18, 22, 31, 33, 34, 39, 41, 42, and 43. 
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TasBie 1. Summary or Moruer-Fatuer Data (N = 20 HIGH AND 20 Low F Ss) 


Mother 
High F Low F 


Mean F Score 133 
Negative responses 

Positive responses 210 
Total responses 258 
Proportion (Neg./Total) .19 


aEdwards’ “) method of comparing two proportions was used. 
>Based on one-tailed tests. 


RESULTS 

Ss responded to the modified F Scale items in any of six ways: strongly agree, 
moderately agree, slightly agree, slightly disagree, moderately disagree, and strongly 
disagree.* Both F+ and F— items were scored in the authoritarian direction. In 
testing the hypothesis, the 20 highest scoring Ss were compared with the 20 lowest 

. scoring Ss. Since the proportion of negative responses to total number of responses 
regarding parental qualities did not differ between male and female Ss, the data for 
the two sexes were pooled. 

The results shown in Table 1 confirm the hypothesis that low F Ss will list a 
higher proportion of negative parental qualities than will high F Ss. These results 
apply to attitudes toward both paternal and maternal qualities. Thus, these results 
aid in the further delineation of the properties of the anti-democratic personality as 
measured by a modification of the California F Scale. 


SuMMARY 


It was predicted from the theory of the Authoritarian Personality that low F 
Ss would list a higher proportion of negative parental qualities than would high F Ss. 
Ss were categorized as high or low F through a modified version of the F Scale which 
consisted of 15 traditional items and 15 worded in the reverse direction. The results 
confirmed the hypothesis at a statistically significant level and are consistent with 
the theory underlying the anti-democratic personality. 
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%Scores were assigned to each item, e. g. ., from strongly agree, +3, to strongly disagree, —3. A 
constant of 4 was added to each item’s score 
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THE VALIDITY OF CLINICAL RATINGS FOR ESTIMATING 
SEVERITY OF SCHIZOPHRENIA! 
WILLIAM A. HUNT, RONALD E. WALKER AND NELSON F. JONES 

Northwestern University Louisiana State University in New Orleans 


As part of an extensive investigation of the nature of clinical judgment“, we 
have studied the reliability of clinical judgment when judges are called upon to rate 
schizophrenic test responses using a seven-point scale: * *), The addition of an 
external criterion now makes it possible to study the validity of such ratings®. This 
is of practical as well as theoretical interest since it raises the possibility of using such 
ratings as “scores” for purposes of estimation and prediction. In the present study a 
criterion group of twelve schizophrenic patients differing in the severity of their dis- 
order was given the Vocabulary and Comprehension subtests from the Wechsler 
adult scale. The individual test responses were then independently rated by another 
group of clinicians using a seven-point scale. Ratings were obtained on four dimen- 
sions: how schizophrenic the response was, the potential intelligence exhibited ir- 
respective of the correctness of the response, communicability, and concreteness- 
abstractness. Using these ratings as raw scores and establishing a rank order based 
upon the mean value for each patient makes possible a comparison with the original 
criterion ranking for severity of disorder. Inter-comparison of the four dimensions 
also makes possible a replication of previous findings on the judged characteristics of 
schizophrenic thinking“. In addition, the data were processed for further informa- 
tion on the reliability of the judgments. 


METHOD 

Establishment of the Criterion. Twelve male schizophrenic:patients selected as cover- 
ing the complete range of severity of the disorder were each given an individual 
interview by a staff psychologist. The interviews were observed through a one-way 
screen by three other staff psychologists, serving as criterion judges. On the basis of 
the interview plus study of additional case and test materials these three judges 
independently rank ordered the patients for severity of schizophrenia. Individual 
differences between the judges were then ironed out in conference and a final single 
rank order unanimously agreed upon. A more detailed discussion of the procedure is 
available elsewhere 


Materials. Immediately following the interview each patient was individually given 
the Vocabulary and Comprehension subtests from the Wechsler scale by still another 
staff psychologist. The test results were then broken down and each response trans- 
cribed on an individual card. These cards were then sorted to randomize them. The 
total comprised 141 Comprehension, and 383 Vocabulary responses, 524 cards in all. 


Subjects. The subjects were four groups of experienced clinicians with five clinicians 
in each group. A separate group was used for each judgmental dimension. All the 
clinicians possessed the Ph.D. degree in clinical psychology and at least four years of 
professional experience. 


Procedure. The 524 cards were mailed to each judge and he was asked to rate each 
response for one of the dimensions, severity of schizophrenia, potential intelligence, 
communicability, and concreteness-abstractness, using a seven-point scale on which 
point one represented a minimal amount of the dimension in question, and point 
seven a maximal amount. The detailed instructions were those used in previous 
studies and may be found complete elsewhere ® *). Using the ratings given each 
patient’s responses by the judges as raw scores, mean values were obtained and used 
1This study is of a r project being con iversi 
search. Facilities for collecting the experimental materials and establishing the criterion were pro- 
vided by the VA Hospital, Hines, Illinois. The opinions expressed, however, are those of the rat ell 
and do not represent opinions or policy of the Naval service nor of the Veterans Administration. 
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to establish rank orders for the twelve patients under the various conditions. Rank 
order correlations between these orderings furnished the basis for the comparisons 
presented here. In establishing the inter-judge agreement which is offered as a 
measure of the reliability of the clinicians’ judgment, each clinician was run against 


the mean value of the other four judges in his particular group, thus providing a cor- 
rection for his influence on the group values. 


RESULTS 


Correlations were run between the rank order of the 12 patients based upon the 
mean ratings of amount of schizophrenic pathology present in their test responses 
and the rank order obtained in the external criterion. For Vocabulary the correlation 
was .57, and for Comprehension .69. The first just misses significance at the .05 level 
and the second attains it easily. Keeping in mind the small number of patients and 
the fact that only five judges rated them we may cautiously conclude that the 
judges’ ratings of severity of pathology are valid within the limitations of our cri- 
terion. This suggests that such ratings may be utilized as scores for estimating the 
severity of disorder present in schizophrenic patients. Since the original criterion 
order was shown to be related to subsequent patient progress, it is possible that 
such ratings may also have some predictive potential. The practical applications are 
obvious and certainly merit further investigation. 

Using the objective scores obtained on the vocabulary and comprehension tests 
and extrapolating with the aid of Wechsler’s tables, an IQ measure was obtained and 
used as a basis for rank ordering. When compared with the rank order for rated 
potentiai intelligence, correlations of .92 for vocabulary and .81 for comprehension 
were obtained, both significant at the .01 level. Since the dimension of potential 
intelligence involves a rating of the judged overall intelligence exhibited independent 
of the actual correctness of the response as viewed by conventional scoring systems 
and that in a sense this dimension might represent the pre-illness level of intelligence, 
these correlations are surprisingly high. It is possible that the judges’ ratings were 
contaminated by the conventional “right” or “‘wrong”’ nature of the responses, but 
we must remember that the statistic used merely indicates rate of change between 
the two variables and tells us nothing about absolute values. Thus our correlations 
only would indicate that the same people are relatively brighter, ill or well. The cor- 
relations between the ratings of potential intelligence and the external criterion of 
severity were —.71 for vocabulary and —.39 for comprehension, the first significant 
at the .05 level. The correlation between the intelligence measure based on objective 
scoring of the tests and the external criterion of severity was —.71. Considering not 
only the small number of patients and judges but the danger of the extrapolation 
procedure used, these results are suggestive of the further possible use of such sub- 
jective rating scales in the evaluation of the patient’s intelligence. 

Rank order intercorrelations were run between the four dimensions of judgment 
to study their interrelations. The results (Table 1) in general confirm our previous 
findings“ and are consonant with the literature as potential intelligence is not re- 


TasLe 1. Rank CoRRELATIONS BETWEEN SCALES 


Previous Study Present Study 


Scales Vocabulary Comprehension 


Schizophrenia & Potential Intelligence 
Schizophrenia & Communicability 
Schizophrenia & Concrete-Abstract 
Potential Intelligence & Communicability 
Potential Intelligence & Concrete-Abstract 
Concrete-Abstract & Communicability 


* = significant at .05 level 
** = significant at .01 level 


** —.16* 
—.90** 
—.69"* —.45** 
56** 
.52** 
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lated to amount of schizophrenic pathology, communicability is highly inversely 
a and there is a moderate relation between schizophrenia and concreteness in 
thinking. 

Reliability of judgment was investigated by comparing the rank order based 
upon the ratings of each jud_-z for each patient for each test and each dimension with 
the rank order based upon the mean values of the other four judges’ ratings. The 480 
correlations thus obtained duplicate our previous findings® * *. For reasons of 
brevity Table 2 gives the rank order correlation between each judge’s orderings of 


TaBLe 2. or Ratines Over 12 Patients 
(Dirrerent JupcEs ror Dimension) 


Judge 
III 


= 
< 


85 


Comprehension 


Communicability 
Vocab’ 


Comprehension 
Concrete-Abstract 


Vocabulary 
Comprehension 


38 
88 sa Re 


38 ae 


the twelve patients and the mean order of the other four judges. The agreement is 
respectable. As previously found’, the concrete-abstract dimension is least reliable, 
with one judge apparently having genuine difficulty with the task. 

Since our criterion presents 12 cases of schizophrenia graded according to the 
severity of the disorder, it is possible, using the mean ratings of the judges for the 
amount of schizophrenic pathology present in the test responses of each patient, to 
plot the shape of the relationship between the amount of the disorder and the in- 
creasing disturbance in thinking as exhibited in the test responses. Within the limit- 
ations of our data, the progression seems to be an orderly one, and the function to 
be linear. 

SUMMARY 


Twelve schizophrenic patients, rank ordered on the basis of the severity of their 
disorder by three experienced clinicians, were given the Vocabulary and Compre- 
hension subtests of the Wechsler intelligence scale. The test responses were then 
rated independently by other clinicians using a seven-point scale and rating for 
amount of schizophrenic pathology present, potential intelligence exhibited, com- 
municability, and concreteness of thinking. Agreement between the ratings of path- 
ology present and the independent criterion of severity demonstrated the validity of 
the judges’ ratings. The relationships between the dimensions judged, and the overall 
interjudge agreement were in accord with previous findings. 
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VALIDATING CLINICAL JUDGMENTS WITH THE 16 P. F. TEST! 
SAMUEL KARSON 
Dade County Child Guidance Clinic, Miami, Florida 


PROBLEM 

The present study attempts to subject certain aspects of the clinical judgments 
of experienced psychiatric social workers to research scrutiny *: 7: %. Specifically, we 
were interested in determining whether the social work staff of a community child 
guidance clinic that had worked together for a minimum of four years, would show 
any significant amount of agreement with one another as well as with an objective 
personality questionnaire when asked to rate the personality traits of mothers whose 
children were found to be in need of psychotherapy. 

The profile sheet of the 16 Personality Factor Questionnaire was used as a rating 
device in order to accomplish the initial task. Since the 16 P. F. test“) measures the 
major dimensions of personality in rating media and in the questionnaire realm, it 
was believed to be especially appropriate for our purposes”. One advantage of this 
method is that it insures the fact that clinical judgments are confined to predicting 
validly established personality factors. 


METHOD 

Prior to obtaining the actual trait ratings, an orientation meeting was con- 
ducted with the social work staff and the terms used to describe the personality fact- 
ors were discussed and reviewed. Thereafter, the ratings on each of the 16 personality 
factors were obtained from the five members of the social work staff. In making the © 
ratings the workers actually used a ten point scale which was later reduced to a 
three point classification for the statistical analysis. 

After the assumption of equal distribution of categories had been met for the 
group as a whole (df = 4, x? = .93, P = n.s.), a Chisquare analysis was performed 
which compared the obtained number of agreements of each judge with the other 
four against the number of agreements to be expected by chance (df = 4, x? = 
41.56, P = < .01). 

Inspection of the workers’ ratings revealed unanimous agreement among the 
staff in characterizing the clinic mothers as high on factors L (Paranoid Suspicion), 
O (Guilt Proneness), and Q, (Free-Floating Anxiety). Further, four out of five 
workers described these mothers as average on B (General Intelligence), low on C 
(Emotionally Unstable), H (Shy, Timid) and Q; (Lax, Uncertain), and high on I 
(Emotional Sensitivity). Three of the five workers agreed in depicting them as 
average on factors A (Warm, Sociable), F (Enthusiastic), and M (Bohemian, Un- 
concerned), low on N (Naive, Simple) and Q: (Group Dependent), and high on E 
(Dominance) and Q, (Experimenting, Critical). Interestingly enough, factor G 
(Super-Ego Strength) showed the least agreement of all with two workers character- 
izing the clinic mothers as high, two as average, and one as low. 

After it had been ascertained that the social work staff shared a fairly common 
“stereotype” regarding the personality traits of clinic mothers, the next step in the 
study was to test the validity of these assigned ratings by actually performing a 
study comparing a sample of clinic mothers with a suitable sample of controls. The 
purpose here was to determine whether the traits selected by the social work staff 
would actually succeed in discriminating between a sample of mothers of disturbed 
children referred to the clinic for psychotherapy (experimental mothers) and a 
sample of mothers whose children were functioning more satisfactorily (control 
mothers). It was acknowledged from the outset that no really proven criteria for 
testing the validity of the workers’ ratings were available, but the recent positive 
results obtained with the MMPI by Liverant “?, the earlier work of Karson“: ®), and 


1The author is grateful to the entire social work staff of the clinic for their cooperation, and to 
Mr. T. Douglas Haupt for his assistance in the computation of statistics reported in this study. 
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the fact that to date no negative results have as yet been reported in the growing 
literature on the 16 P. F. test, encouraged us to use it as a criterion measure. 


Samples. The experimental group consisted of 129 Caucasian mothers whose children 
had been referred to the clinic, and all of whom were later recommended for treat- 
ment. The mean age of the experimental mothers was 36.59, 8. D. 5.51 and they 
had a mean factor B raw score (General Intelligence) on the 16 P. F. test of 7.36, 
S. D. 2.05. The large majority are middle class as judged by family income, occupa- 
tion of their husbands, and educational attainment. In comparison, the sample of 
117 control mothers had a mean age of 36.40, 8. D. 7.17 and their mean factor B raw 
score was 6.98, S. D. 2.03. This sample had a mean number of years of completed 
education of 13.11, 8. D. 1.93. On the basis of the same criteria used for the experi- 
mental mothers, the controls were also considered middle class. When ¢ ratios were 
computed between the respective age and intelligence means of these two samples, 
no significant differences were obtained. 


Procedure. The test data on the experimental mothers was accumulated between 
October, 1958, and March, 1959 by the routine administration of Form A of the 16 
P. F. test by a member of the secretarial staff to all mothers who applied and were 
accepted for clinic services. The testing was accomplished after the intake interview 
with a psychiatric social worker. 

To obtain a comparable sample of control mothers, arrangements were made 
with the Parent Teachers Association at two of the local public schools. Volunteers 
were requested to participate in a study which would require them to fill out a 
questionnaire. Every volunteer was assured of her anonymity. Each mother was 
asked to place an X in the upper right hand corner of her answer sheet if she had 
ever consulted a psychiatrist or a psychologist for herself or for her children. Less 
than 5% of the sample had to be rejected for this reason. 


RESULTS AND DIscussION 


After all the tests had been scored, the differences between the means on each 
of the personality factors were analysed by ¢ ratios (Table 1). Significant mean 


TaBLe 1. Means, SDs, AnD SIGNIFICANCE OF DIFFERENCE BETWEEN MEANS oF EXPERIMENTAL 
(N = 129) anp Conrrot Moruers (N = 117) 


Experimental Mothers Control Mothers 
Factort Mean SD Mean sD tt 
A 5.91 1.97 6.32 1.90 1.66* 
B 4.48 1.87 4.09 1.83 1.45 
Cc 3.23 2.05 3.52 2.00 1.21 
E 5.80 2.15 5.79 2.11 0.17 
F 3.34 1.84 4.15 2.10 3.15** 
G 4.01 1.70 4.33 1.98 1.58 
H 4.78 1.93 5.16 1.78 1.22 
I 7.33 1.48 7.80 1.65 0.59 
L 7.01 2.02 6.40 3.61 0.30 
M 5.36 1.94 5.09 1.97 1.50 
N 5.19 1.77 5.21 1.82 0.02 
Oo 7.19 1.94 6.27 1.97 3.71%* 
1 6.54 1.96 6.52 1.62 0.90 
‘ 6.47 2.03 5.97 1.94 1.80 
s 4.14 2.17 4.89 2.03 3.23** 
‘4 6.94 3.71 6.27 2.34 2.89** 


All scores are based on sten scores. 

All ¢ values are based on raw scores using a one-tailed test. For significance at the .05 level of 
confidence, t must equal 1.65. 

*Significant between the .05 and .01 levels of confidence. 

**Significant at greater than the .01 level of confidence. 
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differences were obtained on test factors A, F, O, Q;, and Q,. A one-tailed test was 
used in performing the calculations since the majority ratings of the staff social 
workers were used to predict the direction of the differences. In descriptive terms, 
the mothers of disturbed children are depicted as significantly less warm and socia- 
ble, more depressed, possessing stronger guilt feelings, being less successful in their 
efforts to bind anxiety, and suffering from more generalized free-floating anxiety 
than mothers of ‘“‘healthy’’ children. Of these five factors, four were significant be- 
yond the .01 level of confidence, while factor A was significant at the .05 level. A 
comparison of these results with the social work staff ratings revealed that the staff 
as a whole was most in error with regard to factor F, since only two of the five work- 
ers rated the clinic mothers as depressed, while three workers rated them as average 
on this factor. On the other hand, it should be noted that none of the workers rated 
the clinic mothers as high on this factor. The conclusion may be drawn here that the 
clinic mothers are actually even more depressed than the workers rated them and 
this finding served, perhaps, to emphasize the importance of a supporting, reassuring 
role on the part of the staff. 

All of the staff were apparently accurate in their assessment of the strong guilt 
feelings and free-floating anxiety manifested by the clinic mothers as measured by 
factors O and Q,. These results suggest that both of these personality factors may 
be relatively easy to recognize in clinic mothers and again emphasized their intense 
need for reassurance and support by the clinic staff. 

The ratings of four of the five workers were sustained by the findings with regard 
to factor Q; which again strongly supports the ability of the social work staff to 
recognize the feelings of uncertainty in these patients and the need to effectively 
deal with them. 

Factor A was rated as average by three of the workers and correctly as low by 
the other two. Although this result was not predicted by the majority of the staff, it 
_— be noted that none of the workers erred in rating the clinic mothers high on 

is factor. 

Some consideration should be given as to whether these five factors successfully 
discriminated because the clinic mothers have had to face their children’s problems 
and as a result have become more “maladjusted” in their personality makeup, or 
whether actual personality factor differences existed even prior to the onset of 
symptoms in the emotionally disturbed child. Unfortunately, at present we have no 
way of knowing whether the obtained differences are causes or effects. It is our 
opinion, however, that measurable premorbid personality differences do exist which 
theoretically make possible the identification of those mothers whose children will 
need psychotherapy. The actual task of identifying these traits remains a primary 
target for future research. 

What of the clinical impressions which were not sustained by the test results 
i.e., the false-positives? After all, we might well expect that working with a clinical 
population would tend to give the staff a biased view, since little professional con- 
tact is afforded with “healthy” or “average” mothers. For the most part, these fears 
are not supported by the evidence. 

The results presented in Table 1 agree with the findings of Liverant “) which 
support the hypothesis that mothers of disturbed children are more maladjusted 
than mothers of nondisturbed children. The present results also suggest that the 
differences found between the two samples of mothers can probably be satisfactorily 
subsumed under the two major second-order questionnaire factors of Anxiety-vs.- 
Dynamic Integration, and Extraversion-vs.-Introversion®: *). In brief, mothers of 
disturbed children are seen as more anxious and introversive than the mothers of 
nondisturbed children. It is believed that this description might well serve as a 
starting place should a research study be undertaken for the express purpose of 
determining whether mothers whose children will need treatment can be identified 


— to the onset of symptoms in the child, or, for that matter, even prior to their 
ving any children. 
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SuMMARY AND CONCLUSIONS 

The present study attempted to investigate the reliability and validity of the 
clinical judgments of an experienced child guidance clinic psychiatric social work 
staff. The task given the workers of rating the personality traits of mothers of 
emotionally disturbed children was similar to the every day clinical work encounter- 
ed by the staff. The profile sheet of the 16 Personality Factor Questionnaire served 
as a rating scale. The results showed a significant amount of agreement among the 
entire staff with regard to their stereotypes of the clinic mothers’ personality traits. 
In order to test the validity of these trait ratings, certain hypotheses were formu- 
lated on the basis of the majority staff judgments. The 16 P. F. test was adminis- 
tered to a sample of clinic mothers as well as to a matched sample of controls. It was 
found that five test factors successfully discriminated the two samples of mothers. 
Comparisons of the social work staff ratings with the statistical results revealed that 
the most valid ratings were made on the eight factors on which the staff as a whole 
showed the most agreement (at least four out of five), while the least valid ratings 
occurred on the eight factors about which the staff showed the least agreement (no 
more than three out of five). The findings were discussed in the light of present 
knowledge of primary and second-order personality factors. 


REFERENCES 


1. Carrey, R. B., Saunpers, D. R. and Sricz, G. F. The Sixteen Personality Factor Question- 
naire. Champaign, Ill: I. P. A. T., 1602 Coronado Drive, 1957, Ed. 


2. oa > Personality and Motivation Structure and Measurement. New York: World 
3. Hunt, J. McV. On the j t of social workers as a source of information in social work 
research. Use of Judgments as in Social Work Research. National Association of Social Work- 


1959. 

4. KARsoN, S. and Poot, K. B. The construct validity of the Sixteen Personality Factors Test. 
J. clin. Psychol., 1957, 13, 245-252. 

Poot, K. B. Second-order factors in personality measurement. J. consult. 

6. Kamion, S. The Sixteen Personality Factor Test in clinical practice. J. clin. Psychol., 1959, 15, 

7. Kogan, L. 8. Validity, reliability, and related considerations. Use of Judgments as Data in 
Social Work Research. National Association of Social Workers, 1959. 

8. Lrverant, 8. MMPI differences between parents of disturbed and nondisturbed children. 
J. consult, eo 1959, 23, 256-260. 

9. Tarr, R. ility to judge people. Psychol. Bull., 1955, 1-23. 

10. Tuorns, F.C. Clinical judgment: A clinician’s viewpoint. J. clin. Psychol., 1960, 16, 128-134. 


A HOSPITAL ADJUSTMENT SCALE FOR CHRONIC DISEASE PATIENTS 
ALBERT H. URMER, ZENA MALEK' AND LEONARD V. WENDLAND 
Rancho Los Amigos Hospital, ey California 
and University of Southern horton 
PROBLEM AND PROCEDURE 

In the past a number of scales have been published to aid hospital staffs in the 
objective evaluation of the patient’s adjustment. Most of these scales had their 
value limited to psychiatric populations. The large number of chronic disease 
patients has es the need for a scale applicable to them. This study was 
designed to fill this need. 

A scale of 55 items, most of which were based on the Hospital Adjustment 
Scale by Ferguson, McReynolds and Ballachey®) was constructed. The scale was 
administered on 42 chronic disease female patients, by having the scale used by two 
nursing personnel having close contact with the patient (close daily contact with 
two weeks’ observation). The rater was asked to evaluate each item on the basis of 


‘Mrs. Malek participated in this research during the tenure of a Predoctoral Research F i 
from the National Institute of Mental Health, United States Public Health Service, and are ship 
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its being True (T), Partially True (PT), Partially False (PF), and False (F) for the 
patient. All four choices were designated for each item and the rater was asked to 
circle the appropriate index. The ratings were made independently so that any two 
raters evaluating the same patient did not compare their ratings. 

The scoring system consisted of assigning a value of 4 to ratings in the direction 
of good adjustment and a score of 1 for poor adjustment. The two items between 
were scored 2 and 3 respectively, depending on which item (True or False) was 
scored 4. The reliability of the ratings was computed for each of the 55 items using 
Ebel’s“) method. 

To reduce the size of the scale and make it more sensitive, 25 items were taken 
from the above scale. An item was picked on two criteria: (a) It had to exceed the 
.01 level of confidence as to reliability (p .01 = .384), and (b) it had to differentiate 
between the first and fourth quartile rating of the patients as to adjustment. 


REsULTs 
Table 1 shows the resulting 25 items (14 from the original Ferguson scale), all 
of which have a reliability coefficient larger than .50 and differentiate between the 
first and fourth quartile (as to good hospital adjustment) at the .01 level of con- 
fidence using the Mann-Whitney U test. The reliability of the new scale using two 
independent ratings for 42 patients is .94. 


1. Hosprrat ADJusTMENT SCALE 


| a PT PF F The patient ignores the activities around her. 
ye PT PF F The patient get up for visitors. 
3. T PT PF _ F The patient follows events in the daily paper. 
PT PF F The patient laughs if she is kidded. 
PF F The patient stays by herself. 
F The patient doesn’t mix with other patients. 
\ i PT PF F The patient doesn’t seek social contacts with other patients. 
a. = rr PF F The patient can tease another patient back into humor. 
9 T PT  ##PF _ F _ The patient remarks when it’s time for visitors. 
_ = ae PF F The patient doesn’t have close friends on the ward. 
ih. = PT PF F The patient won’t discuss many subjects. 
2 Pe PF F The patient is always chatting with someone. 
SS ee, ae PF F The patient knows the names of the doctors, nurses and aides. 
F The patient writes letters. 
6%. FT PT PF F The patient has to be reminded to attend to routine. 
16. T PT PF F The patient offers to help others. 
PF F The patient yells at attendant when she’s dissatisfied. 
18. T PT PF _ F _ The patient shares with others. 
19. T PT PF F The patient easily becomes upset if something doesn’t suit her. 
20. T ey PF F The patient anticipates activities of next day. 
% F PT PF F The patient takes part in ward games and other recreation. 
22. T PT PF F The patient appears cheerful most of the time. 
23. T PT PF F _ The patient would sit all day if not directed to do an activity. 
24. T PT PF F The patient doesn’t like to get out of bed. 
TFT. F The patient stays neat and clean. 


The range of scores possible is from 25 to 100 with the larger scores indicati 
better hospital adjustment. The total score for the 18 ratings of the poorly adjust 
wey ranged from 25 to 43, with the well-adjusted patient score range from 80 
to 99. 


SUMMARY 

This investigation reports a new scale for rating the hospital adjustment of 
chronic disease patients. The scale is easy to administer and should give hospital 
staffs a fairly objective method of evaluating adjustment as well as changes in 
adjustment. Scores below 50 on this scale tend to indicate poor hospital adjustment 
and scores above 75 good hospital adjustment with the 50 to 75 range being average 
adjustment. 
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PATIENT’S EXPECTANCIES AND NON-SPECIFIC THERAPY AS A 
BASIS FOR (UN)SPONTANEOUS REMISSION 


ARNOLD P. GOLDSTEIN 
Staunton Clinic, University of Pittsburgh School of Medicine 


PROBLEM 


The effectiveness of psychotherapy has been an issue of continuing research and 
discussion @: 5 ¢ 17. 18, 22, 26) in recent years. As Cartwright“) observed, the major 
argument of those who view psychotherapy as an ineffective technique has centered 
on the issue of spontaneous remission. This argument holds that because certain 
investigations have shown that a proportion of maladjusted individuals improve in 
a specified time period in the absence of formal psychotherapeutic intervention and 
because this proportion of ‘‘spontaneous” remissions is equal to or even greater 
than the mean reported proportion of successful cases in psychotherapy, therefore 
the latter could plausibly be classified as spontaneous also. Thus, this position 
holds, there is no ground for attributing special remedial effectiveness to psycho- 
therapy. In addition to other bases, Cartwright has taken exception to this con- 
clusion on the grounds that it clearly implies that results derived using a single 
t of experimental design (matched-control design) are sufficient for resolving 

e issue in question. 

With Cartwright’s position as background, the present paper considers the 

spontaneous remission argument in terms of quite a different set of evidence, that 

ealing with patient expectancies and the placebo effect in psychotherapy. Our 
basic contention, following this line of evidence, is that the remission of symptom- 
atology heretofore termed “spontaneous” is, in fact, quite unspontaneous and a 
function of identifiable causative factors. 

Dictionary definitions of ‘spontaneous’ center around such phrases as “without 
any external stimulus” »- “independent of external agencies’ »- etc. 
In the several investigations ®: 1% 1. 4, 16, 18, 21, 24, 26) of the effectiveness of psycho- 
therapy in which a “therapy” and “no-therapy” group design (matched-control 
design) was utilized, the changes taking place in ‘‘no-therapy” subjects have char- 
acteristically been termed “‘spontaneous.’’ Close scrutiny, however, of the conduct 
of these studies clearly demonstrates that a great deal was done to each of the con- 
trol subjects in the course of the investigation. This would include participation in 
any or all of the following activities: (a) initial intake interview, (b) initial psycho- 
logical testing, (c) a social work interview, (d) periodic and/or post wait-period 
psychological testing, (e) post wait-period psychiatric interview, and (f) interview(s) 
with relatives or friends of control subjects, of which these subjects may be aware. 
Therefore, while the individuals constituting the control group are not participating 
in the type of formal psychotherapy under investigation, it is evident, especially to 
them, that a great deal of attention and professional interest is being focused upon 
them. In this regard, it is important to note that several investigators have taken 
the position that, as Hathaway “* has stated: “. . . almost any form of attention 
given to a patient’s problems, whether skilled and upon good theoretical basis or 
unskilled and poorly conceived, is likely to result in improvement.” “> »- %° Sim- 
ilarly, Frank, Etiedman, Imber, Stone and Nash “ have noted that “. . . any form 
of activity by a person culturally defined as heale= may activate a patient’s belief 
that he is being helped.” ». 

Several studies ‘*: have demonstrated that the effects of psychotherapy, 
particularly with reference to symptom reduction, are a partial function of the 

tentiation and activation of the patient’s favorable expectancies regarding such 
improvement. Professional attention paid to the patient in the form of the initial 


‘Italics mine. 
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interview, psychological testing, etc., while not intentionally therapeutic, may 
function as therapeutic to the extent that it activates these favorable expectancies 
and, in general, imparts to the patient a feeling that something is being done for his 
problems. The present writer suggests that improvement taking place in control 
subjects be conceptualized as analogous to a placebo effect in that such improvement 
appears to be a partial function of these non-specific consultative interventions, by 
culturally defined healers, which the patient may interpret as therapeutic activity. 
As will be seen, evidence exists which permits this conceptualization to be further 
elaborated to the following statement: The extent to which a ‘‘no-therapy” control 
group patient improves while awaiting formal psychotherapeutic intervention is a 
joint function of his degree of favorable expectation of improvement from psycho- 
therapy and the extent to which he receives professional attention via such non- 
specific, placebo-like activities as ‘‘status’” interviews, psychological testing, etc. 
More formally, this might be stated: ‘“Spontaneous”’ remission = f (expectation of 
improvement and non-specific professional attention). Since “‘spontaneous’’ re- 
mission is, on the basis of arguments raised thus far, a misnomer, it is suggested that 
the somewhat more unwieldy but appreciably more accurate term ‘non-specific 
therapy remission’’ be substituted for 

Let us now examine the evidence supporting the formulations presented above. 
Hankoff, Freeman and Englehardt“*) investigated the prognostic implications of 
placebo reactivity with outpatients recently discharged from a psychiatric hospital 
who were placed on a three week trial of placebo administration and subsequently 
classified as placebo reactors or as non-reactors. For each patient, subsequent 
clinical course with regard to rehospitalization was noted. Results of the study 
indicated that placebo reactors maintained their non-inpatient status significantly 
longer than did the non-reactors. The authors conclude: “. . . the ability to have a 
favorable symptomatic response to a placebo appears to be a prognostic sign for 
future success in the maintenance of ambulatory status.’’ 

Gliedman, Nash, Imber, Stone and Frank®) compared the effectiveness of 
short-term psychotherapy and placebo administration for the reduction of symptom- 
atology in psychoneurotics. Not only did symptom reduction take place to a signifi- 
cant degree under both treatment conditions, but the magnitude of this symptom re- 
duction was quite similar in both instances, leading the investigators to conclude: 
“.. . it is felt that the capacity to respond to a placebo should be considered a highly 
desired attribute for recovery . . . The distress relief achieved may be thought of as 
a consequence . . . of the impact of the clinic and of the symbolic role of the physician 
on patients who had expectancies for improvement from this particular source of 
help.” ® ». 4) A study by the present writer“®) pertained more directly to the issue 
of “spontaneous” remission in psychotherapy. 


MeEtTHOD 


In this investigation, periodic measures were obtained on all patients (including 
the 15 “no-therapy” control patients) of their expectation of improvement due to 
psychotherapy and their perceived improvement over the course of the wait period. 
As indicated above, perceived improvement in the control patients can be con- 
ceptualized as non-specific therapy remission. The two measures, ted and 
perceived improvement, were obtained at two and one-half week intervals for seven 
and one-half weeks. Before examining the correlations between these measures for 
the control patients, it should be stressed that these individuals received no formal 
psychotherapy during the investigation, but did receive professional attention in 
the form of an initial intake interview and three psychological testing sessions. 


*Although consideration of faith healing and so called miracle cures is beyond the scope of the 
y—appear be 
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RESULTS 
All correlations (Table 1) are highly significant in the predicted direction. Since 
the non-specific therapy of the intake interview and psychological testing were con- 


TaBLe 1. CorreLanons Berween ExpscTep AND PERCEIVED 
IMPROVEMENT FoR ConTROL-GrovuP 


r 


-649 <.01 
. 864 <.01 
-861 <.01 


stant for all subjects, these results may be interpreted as indicating that a significant 
relationship exists, in the absence of formal psychotherapy, between the degree of 
patient’s expectation of improvement and the degree of non-specific therapy re- 
mission. 

Barron and ®) raised the question, as has the present paper: Can the 
patients on a psychotherapy waiting list be really thought of as untreated? They 
state: 

In of course, simply having committed oneself to icipating i x 
and ha had a reciprocal a clinic to afford 
immediately, represents a breaking of the neurotic circle. A force for change has already been 
introduced. In addition, the initial interview and the ical testing may themselves be psycho- 
therapeutic events, since during such sessions the patient es some efforts to confront hi 
and his problems more objectively than he has in the past. ®: ». #8 


In this study, mean pre-post MMPI scores of patients undergoing both group and 
individual psychotherapy were compared to the equivalent scores obtained from 
individuals who had sought psychotherapy but were, at the time of the study, on the 
clinic’s waiting list. Results indicated that the therapy patients did not improve to 
a significantly greater extent than the wait-list controls. The authors conclude that 
the use of wait-list controls in psychotherapy research may be complicated by 
therapeutic factors in the initial evaluations and decisions. 

Support for Barron and Leary’s conclusion is provided by preliminary results 
of an investigation by the pean: writer now in progress. Subjects are administered 
an inventory dealing with current somatic and psychological symptomatic com- 
plaints immediately preceding and following their initial intake interview. Results 
on the 27 patients tested thus far indicate positive pre-post symptomatic change 
(fewer and /or less severe vane in 25 patients and negative change in but two, 
on both the somatic and psychological dimensions. By means of the sign test, these 
differences are significant at the .01 level and, thus, may be interpreted as further 
evidence indicating the therapeutic nature of consultative interventions. 


Discussion 
It was noted earlier that Cartwright“ has taken the position that an investi- 
gator need not compare the extreme conditions of presence and absence of formal 
psychotherapy in order to yield evidence concerning its effectiveness. The arguments 
presented in the present paper clearly lead to the same conclusion. If one, therefore, 
disavows the characteristic use of the ‘‘therapy’’ versus ‘‘no-therapy” experimental 
design as a means of investigating the effectiveness of psychotherapy, three alterna- 
tive approaches which do not appear to suffer from the above biases appear appro- 
priate. 
Own-Control Design. In this approach to psychotherapy research, essentially two 
conditions are imposed upon the patient: (a) a wait-period during which he receives 
no formal psychotherapy but does receive the professional attention herein termed 


‘Italics mine. 


|_| 
Time of Testing 
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non-specific therapy, and (b) an equivalent time period of formal psychotherapy in 
which, in addition to this formal therapy, the consultative interventions are re- 
peated. Since the non-specific therapy in both the “wait” and “psychotherapy”’ 
periods are basically equivalent, this design can be viewed as controlling for non- 
specific therapy effects and any differential of improvement during the psycho- 
therapy period over the wait period can safely be attributed to the intervention of 
formal psychotherapy. 


Manipulation of an In-Therapy Variable. This design, while differing from the own- 
control design in certain basic characteristics, is similar to it in that the patients 
undergoing all treatment conditions receive equivalent amounts of non-specific 
therapy. Unlike the own-control design, patients in the present case all participate 
in formal psychotherapy. This formal psychotherapy is ideally identical for patients 
in all groups with the exception of the single aspect of the therapy which is being 
manipulated. If one is interested, for example, in examining the effectiveness of 
psychotherapy as a function of degree of therapist experience, depth of interpreta- 
tion, therapist directiveness, or any other variable, one can systematically vary 
these factors and thus determine if this variation has a differential effect on patient 
improvement. As Cartwright“ has noted, utilization of this design can shed light 
on the basic question of the effectiveness of psychotherapy. He states “... if it isa 
true statement that ‘as a result of manipulating certain independent variables, 
psychotherapeutic intervention is differentially effective in producing recovery from 
neurosis,’ then the simpler statement that ‘psychotherapeutic intervention is 
effective in producing recovery from neurosis’ is also true.’’ “: P. 2%) 


Placebo Psychotherapy Design. This approach to psychotherapy research, originally 
suggested by Rosenthal and Frank» involves a modification of the matched- 
control or “therapy” versus ‘“‘no-therapy” design which controls for non-specific 
therapy effects. In lieu of a “therapy” and a “no-therapy” group, this design em- 
ploys a formal psychotherapy and a placebo psychotherapy group. The formal 
psychotherapy group members participate in “‘standard’’ psychotherapeutic sessions 
and the consultative, non-specific interventions. The placebo psychotherapy 
patients, in addition to these non-specific interventions, participate in a regular 
activity which is likely to be perceived as formal psychotherapy by them, but which . 
is not formal psychotherapy from the theoretical point of view of the psychothera- . 
peutic system under investigation. This would be the case, for example, if one were 

investigating the effectiveness of psychoanalytic psychotherapy by having the formal 

psychotherapy patients participate in psychoanalytic psychotherapy and the placebo 

psychotherapy patients in desensitization therapy. Since both groups of patients 

would likely feel they were participating in formal psychotherapy and since the 

amount of nonspecific therapy would be the same in both cases, any increment in 

improvement in psychoanalytic patients over desensitization patients could justi- 

ably be attributed to the efficacy of psychoanalytic psychotherapy. ” 


SUMMARY 


The present paper attempted to demonstrate the unspontaneous basis for what 
heretofore has been termed “‘spontaneous” remission. Evidence is presented thata ‘' 
combination of favorable patient expectancies and such nonspecific professional 
intervention as the intake interview and psychological testing are sufficient for in- 
ducing symptomatic change in individuals waiting to participate in formal psycho- 
therapy. Because of this, the “therapy” versus “no-therapy” experimental design _—- 
often used in evaluations of the effectiveness of psychotherapy appears not tobethe ‘' 
comparison of improvement in the presence and absence of psychotherapy which its 
proponents suggest. Three experimental designs not subject to these biases are dis- 
cussed. Finally, it is suggested that ‘‘non-specific therapy remission” be substituted 
for “spontaneous remission’ as a means of describing improvement in wait-list 
patients not participating in formal psychotherapy. 


ox 
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PATIENT’S EXPECTANCIES AND NON-SPECIFIC THERAPY 
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MMPI CORRELATES OF WARD PLACEMENT AMONG 
STATE HOSPITAL PATIENTS! 


LLOYD K. SINES AND REUBEN J. SILVER? 
University of Minnesota Medical School 


In most custodial or continued treatment pee units, patients are gen- 
erally grouped or classified for domiciliary or other purposes on the basis of ostens- 
ible characteristics such as proneness to elopement, the presence of anti-social 
trends, or other evidences of management risk. Such classification of patients is 

enerally accomplished on the basis of clinical judgment supplemented by case 
ar material which may or may not include psychological test data. This paper 
reports data on psychological (test) correlates of differential ward placement of 
patients in a continued treatment psychiatric setting. 


PROCEDURE 


All patients on the continued treatment units at the Fergus Falls State Hospital 
who passed a screening test for reading and were amenable to psychological testing 
were given the MMPI. Data obtained from patients on three wards from each of the 
male and female units which are representative of typical behavioral classes found in 
such settings will be reported. Specifically, an open ward, a closed ward housing 

atients engaged in some form of industrial therapy (work), and a “disturbed” or 
em unit will be contrasted. There were 316 male patients on the three 
wards, of which 144 were assigned to the open ward, 83 to the closed workers’ ward, 
and 89 to the hyperactive ward. There were 295 patients in the total female sample, 
of which 89 resided on the open ward, 115 on the closed workers’ ward, and 91 on 
the hyperactive ward. The age range for the total male group was from 18 to 76, 
with a mean of 48.8 and a standard deviation of 11.5 For the total female group the 
e range was from 19 to 78, with a mean of 47.3 and a standard deviation of 10.9. 
ere was some suggestion of a relationship between age of patient and ward place- 
ment, with the hyperactive units having the lowest mean ages. 


RESULTS 

Mean MMPI subscale scores for the three male samples are reported in Table 1. 
The Hathaway codes“? for the mean profiles are as follows: for the open workers’ 
ward 8'624, for the closed workers’ ward 86'27, and for the hyperactive ward 86'49. 
The profile configurations for the three wards are quite similar, though the mean pro- 
files for closed wards show greater elevation of subscales than does the mean profile 
for the open ward. Both closed ward samples show significantly greater elevation on 
scale F, which is generally interpreted to reflect the tendency to report bizarre 
symptomatology, and is also suggestive of disorganization of thought processes. On 
no other subscales do the closed ward mean profiles differ significantly from the mean 
profile for open ward patients. However, the mean profile for the hyperactive ward 
differs significantly from both the open and closed workers’ ward mean on scale 6 
(Pa), and from the open workers’ ward mean on scale 8 (Sc). There would appear 
then, to be a trend toward greater elevation of subscale mean scores and incre: 
disorganization of thought processes with paranoid mentation, among patients 
housed on closed wards. The mean profile codes indicate that while MMPI con- 
figuration is quite similar on diverse male chronic wards, elevation of subscales var- 
ies, suggesting that closed ward patients are behaviorally and psychiatrically more 
disturbed than open ward patients. 


1This study was supported in part by a grant from the Department of Public Welfare, State of . 
*The study was completed while the investigators were on the staff of the Fergus Falls State 
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In general, the trends observed among male continued treatment profiles are 
replicated in the female sample (Table 1). However, with regard to the closed 


TaBLE 1. Mean MMPI Svusscaz Scores For THREE Types oF MALE AND FeMaLE WaRDS 


Closed Workers’ Hyperactive oon Closed Workers’ H ive 
ale Male Male Female Female ‘emale 
(N = 78) (N = 55) (N = 23) (N = 67) (N = 63) (N = 16) 


7.51 5.19** 
12.37 21.44** 
13.75 
61.00 


o 
lo} 


| 


6.49 
15.19 
16.02 
56.98 

59.95 
56.21 
65.62 
57.95 
72.44 
62.57 
75.90 
64.30* 
56.75 
40.71 


*reliably different from open ward mean (p < .01) 
**reliably different from open ward mean (p < .05) 


workers’ ward, only in the case of scale 9 (Ma) is the difference from the open ward 
mean statistically reliable (p < .05), suggesting that patients on the closed workers’ 
ward are behaviorally more active. While the mean subscale scores for hyperactive 
ward patients do not differ reliably from the mean scores for the closed workers’ 
ward, there are several scale differences between the former and open ward mean 
scores. The differences in mean F and Ma scores between the hyperactive and open 
ward patients are statistically reliable (p < .01), as is the difference observed on the 
Sc scale between the two groups 4 < .05). These subscale mean differences suggest 
that hyperactive ward patients show greater disorganization of thought processes 

a greater tendency to report bizarre symptomatology, and a higher level of behavioral 
activity. The meaning of the difference on L is somewhat obscure, and in fact is in 
the reverse direction from that reported by the present investigators for state hos- 
pital samples of different chronicity and (presumably) severity characteristics. In 
the latter study“) it was found that higher L scores were associated with greater 
psychopathology, and that patients with a diagnosis of neurosis or behavior disorder 
tended to score lower on this scale. The fact that the mean L score among hyper- 
active patients was reliably lower (p < .05) than that of open ward patients runs 
counter to the suggestion that the L scale is positively related to degree of psy- 
chopathology and /or behavioral disturbance. 

The Hathaway codes for the three female mean profiles are as follows: for the 
open ward mean profile 86’42, for the closed workers’ ward mean profile 86’49, and 
for the hyperactive ward 869’4. While all three codes reflect a moderate to severe 
degree of psychiatric and behavioral disturbance, the code for the mean hyperactive 
ward profile reflects the greatest degree of disturbance and would generally be inter- 
preted as indicative of a grossly disorganized, “hyperactive” clinical picture. 


Discussion 

The data reported here indicate that there are psychological correlates of differ- 
ential ward placement of chronic psychiatric patients in the state hospital setting. 
These data supplement an earlier report of objective behavioral differences associated 
with such classification of patients as reflected in ratings by ward personnel “). While 
it might be argued that ward assignment of patients might be influenced by the same 
ward personnel who provide behavioral ratings, the evidence presented here that 
ward placement of patients has objective psychological correlates (as measured by 
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the MMPI) is especially impressive because ward placement or transfer of patients 
was in no way determined or influenced by the psychological test data. That is, in 
no way could the MMPI data in the present study be construed as contributing to 
the fact of ward placement or transfer of patients, since the MMPI data were gather- 
ed ‘after the fact of such placement or transfer. The relationships reported, then, 
between the MMPI and the ward status of patients may be viewed as reflecting 
actual psychological correlates of the objective behavioral fact. 

Differences among the ward samples noted in terms of MMPI characteristics, 
it should be noted, are consistent with the usual characterizations of the behavioral 
and psychiatric status of patients therein. It would appear that as one progresses 
from open to closed wards there is a trend toward a disintegration of behavioral and 
thought patterns, the reporting of more bizarre psychic symptomatology and, in 
general, greater psychiatric impairment which is reflected by both specific MMPI 
subscale values and the mean profile code. The latter findings should be qualified to 
indicate that it is generally elevation rather than configuration of the MMPI which 
varies as a function of ward placement. 

Finally, it is especially noteworthy that the MMPI is sensitive to behavioral 
and psychological correlates of ward placement even though a very important 
selective factor was operating in the obtaining of the MMPI data. That is, it was 
noted on all wards that the most severely disturbed (behaviorally or psychiatrically) 
patients were unable to take the MMPI. While relatively few patients on the open 
ward or on the closed workers’ ward failed to complete the MMPI for this reason, 
only a minority of the hyperactive ward patients were amenable to MMPI testing. 
Thus, completing the MMPI to some extent selects patients in terms of at least 
minimal behavioral integration and psychological accessibility. Specifically, it was 
noted that as L-M score®) decreased (toward less behavioral integration) the pro- 
portion of patients completing the MMPI also decreased, and that below an L-M 
score of 2.25 relatively few patients complete the MMPI. The actual percentage of 


patients able to complete the test was 76 for the open ward, 55 for the closed worker’s 
ward, and 17 for the hyperactive ward in the case of females, and 64, 51, and 26, 
respectively, among males. Therefore, with even an attenuated behavioral range it 
would appear that the MMPI is sensitive to psychological differences among 
chronic psychiatric patients which accompany differential ward placement. 


SuMMARY 

All continued treatment unit patients at the Fergus Falls State Hospital who 
were amenable to psychological testing were given the MMPI. Profile character- 
istics, including subscale mean scores and mean profile codes for three representative 
types of wards from the male and female sections of the hospital were compared. 
It was found that subscale scores tended to increase from open ward, to closed 
workers’ ward, to hyperactive ward, and that the mean profile codes reflected ele- 
vational (though not configural) differences among the wards. 

It was also noted that only the more integrated patients (as reflected by gross 
behavioral ratings) were able to complete the MMPI and, since the proportion of 
testable patients decreased from open to hyperactive wards, it is quite likely that 
such selection considerably attenuated the psychological differences which accom- 
pany differential ward placement of patients. The MMPI differences noted among 
wards were compatible with the behavioral and psychiatric characteristics usually 
ascribed to such groups, with the open ward groups obtaining the least pathological 
mean profiles, and the hyperactive groups the most “disturbed” profiles. 
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THE EFFECT OF PATERNAL FAILURE TO PARTICIPATE DURING 
THE REFERRAL OF CHILD PSYCHIATRIC PATIENTS 
LUCIANO L’ABATE 
Washington University Medical School 
PROBLEM 

This study examines whether the absence or presence of the father at the time 
of a child’s referral to a mental health clinic is a significant variable in the mother’s 
emotional disturbance and in the child’s maladjustment. One would expect to find a 
greater degree of emotional disturbance in the mother and greater maladjustment in 


the child when the father is absent. If the father is present, the degree of emotional 
disturbance in the mother and maladjustment in the child should be less. 


METHOD 


Experimental Samples. Twenty-one couples of parents and 27 mothers not ac- 
companied by their husbands who came to a mental health clinic for behavior prob- 
lems in their child were administered the MMPI. These parents were all in their 
thirties and from lower to middle socio-economic semirural background. Nineteen 
of their children, all boys between 9 and 13 years of age, were individually admin- 
istered Rogers’ Test of Personality Adjustment as well as other clinical tests. Nine 
of these boys came with both parents, and 10 boys came with their mother only. 


Control Samples. The MMPI profiles of 49 couples of parents of comparable back- 
ground and age as the experimental parents were taken from Liverant’s recent 
study“). Forty-nine boys without behavior problems and of similar semirural and 
geographical background and age (between 9 and 13) as the experimental boys were 
administered Rogers’ Test in groups of 5 or 10. 


RESULTS 


The experimental mothers as a whole group (N = 48) differed from the control 
mothers on the MMPI, obtaining a significantly higher mean in the number of items 
left unanswered, and a lower K in the validity scales. In the clinical scales, they 
showed higher means on the neurotic triad and on Pt (p < .05)“). However, when 
the experimental mothers were divided into two groups, those who came to the 
clinic with their husband (N = 21) and those who came without their husband 
(N = 27), the differences between the MMPI means disappeared. The latter group 
of mothers differed from control mothers only on the D and Pt scales (p< .05) even 
though maladjustment for the three groups was in the predicted direction. 

The experimental fathers differed from control fathers in a greater number of 
items unanswered (p < .01) and higher means on Pa and Sc scales (p < .05). Other 
scales did show higher means for the experimental fathers, but the large SDs of the 
experimental fathers precluded further statistical testing of the differences between 
means. Furthermore, the MMPI profiles of the two groups of experimental mothers 
whose boys had been administered the Rogers’ Test were compared. The direction 
of a greater emotional disturbance in mothers without husbands still existed al- 
though the small number of mothers in both groups did not allow a more rigorous 
statistical validation. 

_ The direction of greater maladjustment for boys who came to the clinic without 
their father is also in the predicted direction. They were more maladjusted than 
control boys in personal inferiority (PI), social maladjustment (Sm), daydreaming 
(Dd), and the total maladjustment score. This group of boys were in turn more 
maladjusted than the group of boys who came to the clinic with both parents. How- 
ever, the latter group differed significantly from control boys only on the social 
maladjustment scale, a difference which affected markedly the total maladjustment 
score. No significant differences in family maladjustment (FM) were present in the 
three groups (Table 1). 
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TaBLE 1. MeEan Rogers’ Test ScorEs AND SIGNIFICANCE OF DIFFERENCE BETWEEN MEANS FOR 
EXPERIMENTAL AND ContTROL Boys 


PI SM 
Mean SM| Mean SD 


9.79 14.59) 4.63 


FM 
Mean SD 


7.77) 3.40 


Total 
Mean SD 


34.00} 6.52 


Control Boys N = 49 3.54 


Experimental Boys with 
both Parents N = 9 


Experimental Boys with 
Mother only N = 10 


11.11) 3.55 4.30} 9.33) 3.08 40.89 


18.44 6.49 


15.10| 5.54|17.40| 4.54) 9.20) 4.85 8.78 


P 2 P 2 P 


Control vs. Experimental 
Boys with Parents 


Control vs. Experimental 
Boys with Mother only 


2.11 1.09 -003 


1.90 -001 


SUMMARY 

The aim of this study was to examine whether the presence or absence of the 
father during the child’s referral to a mental health clinic was a significant variable 
affecting the level of emotional disturbance in the mother and of maladjustment in 
the child. According to differences in MMPI profiles, mothers who brought their 
child to a mental health clinic without their husbands appeared to be more dis- 
turbed than control mothers and mothers who were accompanied by their husbands 
when they brought their child to the same clinic. The same differences found among 
mothers held between two groups of experimental boys and a group of control boys 
according to their scores on Rogers’ Test of Personality Adjustment. Boys who were 
accompanied only by their mothers appeared to be more maladjusted than boys who 
were accompanied by both parents. Both groups of experimental boys appeared 


more maladjusted than a group of control boys. 
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CHANGES IN DIRECTION OF HOSTILITY RELATED TO 


INCARCERATION AND TREATMENT 


LAWRENCE A. BENNETT 


Californi C 


One of the implicit assumptions underlying the 
rationale for the incarceration of delinquents is 
that somehow a period away from normal society 
will bring about changes in attitude and behav- 
ior, leading toward more adaptive ways of dealing 
with frustration. However, there has been little 
or no investigation of the relationship of the 
length of incarceration to changes of attitude. 
Recently there has been much emphasis upon the 
modification of attitudes through counseling or 
psychotherapy. The efficacy of such procedures 
remains to be demonstrated. 


AND 


ALVIN RUDOFF 


Pilot Intensive Counseling ‘9 ation 
Deuel Vocational Institution, Tracy, Calif. 


Hostility is readily observable in incarcerated 
inmates and, in view of its apparent relationship 
to “‘acting out’’ behavior, can be considered one 
of the major attitudes that must undergo modi- 
fication if more adaptive behavior is to result. 
Peizer“) examined differences in the direction of 
expressed hostility between groups of inmates at 
Ohio State Reformatory incarcerated for varying 
lengths of time. Utilizing the Rosenzweig Picture 
Frustration Test), he tested groups at the end 
of one year of incarceration oh after three years 
incarceration. He found that there was less use 
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made of extrapunitive responses by individuals 
incarcerated for more than three years than by 
individuals incarcerated for less one year. 
On the other hand, the group incarcerated for the 
longer period made much greater use of the intro- 
— responses than did the newly-confined. 
hus, he was able to support his hypothesis that 
the direction of hostility tended to change from 
“extrapunitive” towards “intropunitive”’ 
time as a result of adaptation to incarceration in 
a correctional setting. These findings were con- 
sistent with observations of incarcerated inmates 
and could be reasonably explained in terms of the 
theoretical framework developed by Rosenz- 
weig®), During the initial phases of adjusting to 
incarceration, inmate’s hostile attitudes in- 
crease. His characteristic method of handling 
tensions and hostilities is to express them be- 
haviorly, but now institutional restrictions mark- 
edly il his use of this mechanism. Thus, he 
turns these feelings inwardly with the resultant 
behavior taking the form of vague hypochond- 
riacal complaints, hysteroid tendencies, emphasis 
on body building, etc. However, after some time 
has elapsed, he becomes aware that this behavior 
is maladaptive and he then becomes more accept- 
ing of conditions and hi , reacting in a more 
mature manner. 

The present study evaluates the applicability 
of Peizer’s findings to the more youthful offender 
serving shorter sentences. dditionally, the 
effects of intensive treatment upon both the 
direction and rapidity of this change will be con- 
sidered. Within the context of the Rosenzweig 
conceptional framework and the Peizer study, 
the following hypothesis will be examined: 


a. Thelo the incarceration, the ter 
the changes from 
intropunitiveness tow impunitiveness. 


b. Treatment, as operationally defined, will 
accelerate the rate of this change. 


PROCEDURE 

Three hundred inmates were randomly select- 
ed from a population of young male adults (17- 
21) incarcerated at Deuel (Calif.) Vocational In- 
stitution. One hundred of these a were 
tested shortly after their arrival at institu- 
tion; one hundred more were tested after six 
months of incarceration; and finally, one hundred 
were tested after having been incarcerated for one 
year. Each of these three groups was sub-divided 
into two groups. The intake group was divided 
into matched groups as no intensive treatment 
had been introduced at this point, the other 
groups were equally divided with one receiving 
intensive treatment and the other receiving the 
usual institutional program. Assignment to 
treatment and control groups had been made on a 
random basis as a part of a broader on-going re- 
search program “), 

Intensive treatment consisted of at least once a 
week individual contact with a trained social 
experimental and control groups participated in 
the total rehabilitation program of the institu- 
tion, including academic education, vocational 

ining, religious counseling, lay group counsel- 
ing and recreational activities. member of 
the various groups was tested with the Rosen- 
zweig Picture tion Test scored in the 
stan manner ®) 


AND Discussion 
A two-way classification analysis of variance 
was performed for each of the three scoring cate 
ries of the test (see Table 1). The ts in- 
icate that time had no effect on extrapunitive 
responses, some ible effect on intropunitive 
responses (p = .05) and a significant effect on 
impunitive nses (p = .01). Table 2 illus- 

trates the ts of the analysis of mean t- 

age scores of the various groups in of the 


1, ANALYSIS OF VARIANCE 


Source of Variance 


Time 
Treatment 
Interaction 
Within 


Intropunitive Category 


of Squares d. f. M.S. F 
Extrapunitive Category 
167 83.5 .36 
67 67.0 .29 
93 46.5 .20 
68,175 231.9 
Total 68,502 
Time 660 2 330.0 3.42* 
Treatment 194 1 194.0 2.01 
Interaction 365 2 182.5 1.89 
Within 28,361 294 96.46 
Total 29,580 299 
Impunitive Category 
Time 1,420 2 710.0 7.01** 
Treatment 64 1 64.0 -63 
Interaction 94 2 47.0 .46 
Within 29,781 294 101.29 
Total 31,359 299 
“Significant at the .05 level 
**Significant at the .01 level ‘ 
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TaBLe 2. Mean Percentace Scores By Direction, 
anp Group 


Groups Intake 


Six Months One Year 


Extrapunitive 


tal 38.09 


Control 38.09 


(E) 
38.66 


Intropunitive (I) 


Experimental 
Control 


32.08 
32.08 


29.68 
34.42 


Impunitive (M) 


Experimental 
Control 


31.98 
29.46 


categories. The largest difference ap in the 
comparison between intensively and not intens- 
ively treated groups at the six month period in 
the sete ae pty category. A t-test of the differ- 
ence resulted in a p of .05. Intensive treatment 
appears to effect an earlier reduction of intro- 


punitive responses. 

Thus, the initial hypothesis that length of in- 
carceration is related to modification of hostile 
attitudes is partially supported. While there is a 
decrease of extrapunitive responses over time, it 
is not great enough to be statistically significant. 
The decrease in the use of intropunitive responses 
was great enough to tend towards statistical sig- 
nificance. The increase in impunitive responses 
through time, brought about by the combined 
reduction in the other two areas, reached a level 
of acceptable statistical significance. 

The second hypothesis that the application of 
intensive treatment will accelerate the progress 
through the various stages, gained but little sup- 
port. There is some suggestion that intensive 
treatment interacted in a itive manner with 
time in bringing about an increase in the use of 
impunitive responses. There is also some 
tion that intensive treatment brings about an 
earlier reduction in the use of intropunitive re- 
sponses. 


The effect of length of incarceration and inten- 
sive treatment on youthful offenders was explored 
~~? an analysis of the scores on the Rosenz- 
weig Picture Frustration Test. The results sug- 


tt that the longer the offender remains in the 
institution, the more likely he is to increase his 
use of impunitive responses to frustrating situa- 
tions. Intensive treatment, as operationally de- 
fined, seems to have little effect. Extrapunitive 
responses do not en to be affec by the 
presently measures length of incarceration and/ 
or intensive treatment. Intropunitive responses 
tend to decrease with time. Treated and control 
groups at the six month period of incarceration 
indicated a differential response suggesting that 
treatment contributes to the more rapid re- 
duction in the use of intropunitive reactions to 
frustration. 
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INTERPROFESSIONAL AWARENESS OF ROLES! 
CYRIL R. MILL? 
Department of Mental Hygiene and Hospitals, Richmond, Virginia 


PROBLEM 

What attributes are necessary for an efficient, smoothly functioning team? As a 
paradigm for such a unit, one might consider the team at work in an operating room, 
or asymphony orchestra, or an anti-aircraft battery. As these units swing into action 
one is aware of the leadership skills that are used, the skill and technique of each 
team member, and the coordination which goes into their performance. There is a 
division of labor which through the countless repetitions of training and practice en- 
ables each team member to know his exact role and also to have a fairly complete 
understanding of the role of the other team members. 

In a hospital the picture is complicated by size and numbers so that it is un- 
realistic to expect the same coordination among staff members as that displayed in a 
small group. However, it can be assumed that the staff will carry out its functions 
with greater efficiency, provided that each employee has a good knowledge not only 
of his own job but also of the jobs of his co-workers. This investigation was designed 
to determine how well each mental health profession could describe its own job and 
the jobs of others who work in the same setting. ye 


PROCEDURE 

Information was obtained by direct interviews with the physicians (N = 63), 
psychologists (N = 24) social workers (N = 30), nurses (N = 55), and attendants 
(N = 122) of six mental hospitals and one training school for the mentally retarded 
in Virginia. Intact groups, that is, the entire available staffs, were interviewed in 
the case of the physicians, psychologists, and social workers. Because of the larger 
number of employees in the fields of nursing and attendants, a sample was selected 
from these groups which covered all levels of positions and most of the departments 
of the hospital. 

The data were collected by the psychologists of each hospital after they them- 
selves had been interviewed by the chief psychologist, who was excluded from the 
study. Each subject was asked ‘‘What does a (physician, psychologist, etc.) do in a 
hospital like this?”’ Interviewers could probe in a non-leading fashion for the most 
complete job description that the subject could give, but they could not suggest any 
form or substance of reply. : 

The verbatim replies were rated on a 5-point scale by a team of psychologists 
who first worked out criteria for judging. The general rating scale was as follows: 

5 points—an all-inclusive description, with specifics in detail, such as would be 

given by an expert in describing his own profession. 

4 points—a good description, but missing some important features. 

3 points—a few general statements and a few specifics, but incomplete. 

2 points—one generality or very limited specific. 

1 point —no knowledge of the profession, or erroneous knowledge. 


With the aid of specialists in each profession this general rating scale was then 
filled in with details applicable to each field. For physicians, a five-point description 
would have to mention the three areas of medical responsibility, supervisory or ad- 
ministrative responsibilities, and duties related to therapies such as chemotherapy, 
psychotherapy, shock therapy, etc. For psychology, the most acceptable descri 
tion should mention the five areas of diagnostic evaluation, psychotherapy, research, 


1Portions of this paper were read by Arthur Centor at the annual meeting of Southeastern Psycho- 
cal Association in Atlanta, Ga., in March, 1960. 
*This investigation was planned and carried out with the cooperation of the psychologists em- 
ployed in the hospitals and training schools of Virginia, 1959. 
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in-service training of staff, and administration. For social work, mention was re- 
quired of obtaining social histories, working with patients’ families, giving assistance 
to patients in relation to home problems and rehabilitation, and acting as liaison for 
home communities and other agencies. For nursing, mention must be made of 
medical and nursing care, administration and supervision of a unit or of personnel, 
and the training of attendants. For the attendant, a description should include 
supervision of a ward and direct patient care, with responses rated qualitatively ac- 
cording to the amount of detail included in the response. The resulting ratings pro- 
vided the numerical data from which the following findings were drawn. 


RESULTS 


Table 1 shows the mean ratings of job descriptions given by the total sample. 
The natural expectation that each profession would rate highest on its self-descrip- 
tion did not entirely hold up. Psychologists, social workers, and attendants de- 


1. Mean Ratings or Jos Descriptions Given BY Em ‘LOYEES 


Profession Described 


Described by Physician Psychology Social Work 


Nursing Attendant All others 


Physicians 


3.63 3.61 3.67 3.46 3.28 3.51 
Psychologists 3.50 4.41 4.12 3.50 3.75 3.72 
Social Workers 3.66 4.23 4.46 3.36 3.63 3.72 
Nurses 3.14 3.60 3.75 3.92 4.01 3.63 
Attendants 2.55 2.64 2.38 2.74 3.41 2.58 
Mean 3.30 3.70 3.68 3.40 3.62 
8. D. 1.01 1.08 a 1.06 1.05 


scribed their own role better than others, but physicians rated higher on their de- 
scriptions of social work than on their own role, and nurses rated better on descrip- 
tions of the work of attendants than on nursing. The most adequately described pro- 
fession was psychology, with social work coming next followed by attendant, nursing 
and physician in that order. 

The top ranking of psychology in Table 2 comes as a surprise, since this pro- 
fession so often feels that it is misunderstood“. The mean rating of the descriptions 
of psychology was significantly higher (.01 = level) than the mean of the job of the 
physician, and nursing. The difference between mean ratings on social work and 
physicians, and social work and nursing was equally significant, as was also the 


Taste 2. Rank oF Megan Ratinas 


Mean 
1. Psychol 3.70 .063 
2. Social Work 3.68 .071 
3. Attendant 3.62 .061 
4. Nursing 3.40 -062 
5. Physician 3.30 .059 

Comparisons of Difference Between Means 
Diff. t P 

Psychology and Physician -402 4.67 <.01 
Psychology and Nursing .302 3.43 <.01 
Psychology and Attendant .082 .64 
Nursing and Physician .100 1.16 
Attendant and Physician .320 3.77 01 
Social Work and Physician .380 4.13 <.01 
Social Work and Nursing . 280 2.96 01 
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difference between the ratings on the jobs of attendants and physicians. The job of 
the physician was least adequately described. 

How well can each sodlantek describe the other professions? The last column 
of Table 1 shows the mean ratings of descriptions by each profession of all others, 
with self-descriptions excluded. Psychologists and social workers share the top rank 
in adequacy of descriptions of others. Nurses come next, followed by physicians 
and attendants. 

Finally, the data of Table 1 provide a measure of “social distance” if we look for 
the lowest mean, or the profession most poorly described by each profession. Physi- 
cians described the work of attendants less well than they described any other. 
Psychologists were least adequate in describing the jobs of physicians and nurses. 
Social workers’ poorest descriptions were of nursing. Nurses, surprisingly, were the 
most inadequate in their description of the work of physicians. And attendants were 
least well acquainted with social work. 

According to these results, much remains to be done before role-definition in 
mental hospitals reaches the point where people can work efficiently with one an- 
other without treading on each other’s toes. As mentioned earlier, a good team is 
made up of persons who know their own job and also the function of the other team 
members. The question arises, who is to draw the line between professions? Who 
determines the area of funtcion and says “beyond this you are getting out of your 
field?” The corollary question must also be asked: “Is it desirable to set up bound- 
aries or is it best to tolerate some strife as each profession settles into whatever group 
of tasks it finds most comfortable?” 

This study suggests that the long and self-conscious effort of psychology and 
social work to define their role both to themselves and to others may be beginning 
to pay off. Certainly the other professions will be better able to use our services if 
they have a clear understanding of what we do. 

This study points up the ambiguous position of the non-psychiatric physician 
in a mental hospital. Many of them are trained outside the United States. They 
come into the mental hospitals and with little orientation are asked to function in a 
variety of roles, medical, administrative, and psychiatric, which may be consider- 
ably removed from their previous experience. Since they work in a mental hospital, 
they are usually seen as psychiatrists by patients and the public. Perhaps the low 
rank obtained by physicians and on job descriptions of physicians indicates that 
more effort should be expended in acquainting them and others of their role and 
their relationship to other professions. 

Nursing also is in need of interpretation to colleagues, and perhaps additional 
self-examination. Nursing, today, is often a far cry from the direct bedside care 
pictured in vocational recruitment posters. Psychiatric nursing, particularly, has 
moved into areas of administration, supervision, and the training of aides. Con- 
fusion and relative lack of information about the nursing profession by nurses and 
their co-workers is clearly revealed in the mean ratings of this study. 


SUMMARY 


In this study, job descriptions given by employees in five hospital professions 
were rated and compared. The results suggest that psychology and psychiatric 
social work are understood better than the other professions. The jobs of the physi- 
cians and nurses are least understood. Psychologists and social workers also appear 
to have the best understanding of the other professions, while the physicians show 
the least understanding of the jobs of their co-workers. 


Implications of these results for efficient team-work and coordination within 
hospitals are discussed. 
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THE COLORED PROGRESSIVE MATRICES AS AN INDICATOR OF 
BRAIN DAMAGE* 


CHARLES W. DILs! 
Longview State Hospital 
Cincinnati, Ohio 


PROBLEM 


A pilot study by the author has indicated that the Colored Progressive Matrices 
(CPM), through the utilization of scatter analysis, may be useful in discriminating 
between brain damaged and non-brain damaged persons. On the same study, a 
group of psychotics showed a wide differential in CPM scores and could not suc- 
cessfully be discriminated from the other two diagnostic groups. The present study, 
eliminating psychotic groups, compares the performance of organics and non- 
organics on the Colored Progressive Matrices. 

At least three previous studies“: *: 4) have demonstrated the sensitivity of the 
regular Progressive Matrices (1938 form) to brain damage. It has been our exper- 
ience, however, that the regular Matrices is too lengthy (60 items) and too difficult 
for general clinical use. A shorter (36 items), less difficult and more attractive ver- 
sion, the CPM (1947 form), is more suitable for testing persons suspected of having 
impaired intellectual functioning. 


METHODS 


Subjects. Two groups of 60 hospitalized Ss were selected with approximately 
75% of each group being psychiatric patients at a mental hospital and 25% being 
medical patients at a long-term hospital. The two groups were controlled for factors 
of age, sex, race and education, and there were no significant differences. The age 
range was 18 to 60. All Ss were literate, had at least 2 years of schooling, and had 
vision adequate to perceive the test stimuli (determined by the ability to read 4 inch 
numbers). None were included who had evidence of idiopathic (endogenous) mental 
deficiency, who manifested signs of psychosis prior to or during testing, or who had - 
a history of schizophrenia or manic-depressive psychoses. 

The organic group consisted of 60 subjects with positive medical’ and /or 
anamnestic evidence of irreversible brain damage, or, in cases where such were lack- 
ing, corroberative psychiatric and psychological diagnoses of brain damage (based 
on mental status exams and psychodiagnostic evaluations, respectively). The mean 
educational level for this group was ninth grade, and the mean age was 41.8 years. 
The diagnostic composition was as follows: (chronic brain syndrome due to) al- 
coholism 22, cerebral vascular accidents 9, unknown or unspecified causes 9, brain 
—. 1 multiple sclerosis 5, convulsive disorder 4, post-encephalitis 2, and CNS 
syp 

The control group consisted of 60 subjects without anamnestic, medical, psychi- 
atric or psychological evidence of brain damage. The diagnostic composition was as 
follows: sociopathic personality disorder 27, hospitalized normals with physical ill- 
ness or disability 14, personality trait and pattern disturbance 14, and psycho- 
neurosis 5. The mean educational level was ninth grade, and the mean age was 39.2 


A third group, referred to as questionable, consisted of 18 subjects without 
anamnestic or medical evidence of brain damage but with conflicting psychiatric and 


*A paper based on this study was nted at the annual meeting of psychologists of the State 
of Ohio, Division of Mental ine, olumbus, May 27, 1960. a 
_ ‘Appreciation is expressed to the medical staff of Drake Memorial Hospital and County Home, 
Cincinnati, for their cooperation in securing some of the subjects for this eh , 
*Medical evidence of brain damage refers to neurological signs, cranial a laboratory tests 
such as spinal fluid analysis, E. E. G., ete., and other information which enabled the physician to make 
a relatively definite diagnosis of chronic brain syndrome. 
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psychological diagnostic impressions. This group was not controlled for the pre- 
viously mentioned variables and was used to make comparisons only on the basis 
that — scores might be expected to fall between those of the organics and the 
controls. 


Administration. The book form of the CPM was used. Ss were tested in- 
dividually, and their responses were recorded by the examiner. The instructions used 
were those in the manual for the regular Progressive Matrices®), as they are the 
instructions for adults and those in the CPM manual are for children. The wording 
was changed slightly in order to “Americanize” the terminology, i.e. correct was 
substituted for quite right, etc. Group administration is possible but it not recom- 
mended for clinical purposes. 


Scoring. Regular scores were computed but were not utilized in this study. A 
form of scatter analysis was used to differentiate the protocols of organics from those 
of controls. This was based on the rationale that persons without demonstrable 
impairment of intellectual functioning show a typical scatter pattern consisting of 
errors on the last items of sets A and Ab, and on several of the last six items on set B. 
This pattern has proven to be so consistent for adults (at least those individually » 
tested) that any remarkable deviation in the direction of more errors is almost 
certain to be indicative of pathological functioning. A special method of scoring 
based on scatter interpretation was named the scatter unit score defined as two 
errors on items 1 through 11 of set A, or three errors on items 1 through 11 of set 
Ab, or two errors on items 1 through 6 of set B. Errors in excess of one unit, but 
less than two units on any set were scored as .5 unit. Thus it is possible to have 
errors comprising from 1 to 5 units on set A, from 1 to 3.5 units on set Ab, and from 
1 to 3 units on set B. The sum of units on all three sets constitutes the scatter unit 
score. 


AND Discussion 


Establishing an optimal cutoff score at one or more units, the scatter unit 
system correctly identified 82% of the organics and 92% of the controls. These 
results are significant beyond the .001 level of confidence (median test). Exactly 
50% of the questionable group were above or below the cutoff point, dividing the 
group evenly into organic and control classifications. 


Tas.e 1. Distrisution oF Scatrer Unit Scores 


Scores Organic Control Questionable Classification 
Group Group Group 


ll 55 Nonorganic 


1 


on oO 


60 


The results suggest that the CPM, in addition to its original use of providing 
a nonverbal measure of intelligence, may be a valuable instrument for the detection 
of organic brain dysfunction when conditions of psychosis and idiopathic mental 
deficiency can be excluded. The chief value of the scatter unit scoring system ap- 


0 
1 
2 
1 
= 18 
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pears to be in the affirmation of organicity in cases where scores are beyond the 
cutoff point, since a very small percentage of controls scored within organic limits. 
The sensitivity of the test, for this purpose, can be increased by raising the cutoff 
score. Conversely, scores below the cutoff point are not as strongly indicative of the 
absence of brain damage because proportionately more organics fall within control 
limits; thus it would not be practicable to lower the cutoff score in an attempt to 
gain greater sensitivity to organicity. 

The 50-50 classification of the questionables is interpreted as a substantiation 
of the efficacy of the diagnostic groupings, since the presence of brain damage in 
this group was not agreed upon. The group would be expected to form a borderline 
category, falling between the two clearly defined diagnostic groups in scoring, and 
such was the case. 

Some remarks seem worthy concerning the nature of the problems that are 
most useful in discriminating between the organics and controls. The items that 
were missed by a significantly greater number of organics were (in descending order 
of significance as determined by x’): Ab 5, Ab 9, Ab 6, A 10, Ab 11, Ab7, Ab8, A 11, 
Ab 4, B 6, A7, B 4, Ab 10, A8, A 9, B 5, Ab 2, Ab 3, B 3. These are mostly con- 
figurational and gestalt patterns requiring symmetrical completion and a perception 
of closure. In the light of current findings and theories“) about the effects of brain 
damage upon mental processes, it would be anticipated that this type of problem 
could present difficulty in solution by brain damaged persons. The number of cases 
of clearly defined localized damage were not sufficient to permit an analysis of 
po og the CPM is more sensitive to damage in certain areas of the brain than 
others. 

A factor not controlled in the study was total time required to complete the 
test. This may prove to be important because some organics who obtained high 
scores were observed to be exceptionally slow, requiring 15 or 20 minutes for com- 
pletion. The average time for normals is cateenan at about eight minutes. 

Another important observation is that institutionalized adolescents tend to 
show a relatively high atypical scatter, thus reducing the usefulness of the test for 
that age group. 


SUMMARY 


The CPM protocols of 60 organic Ss were compared with those of 60 non- 
organic controls. Equal proportions in each group were selected from a mental 
hospital and from medical patients in a long-term hospital. Ss with functional 
psychoses and idiopathic mental deficiency were excluded, and the groups were 
controlled for factors of age, education, sex and race. A special scoring system was 
devised that correctly identified 82% of the organics and- 92% of the controls. The 
CPM shows promise of being a valuable instrument in the detection of brain damage, 
but the results of additional studies should be awaited before this technique is 
applied to general clinical use. 
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INTELLIGENCE AND VERBAL KNOWLEDGE IN RELATION 
TO EPSTEIN’S OVERINCLUSION TEST* 
MAHESH M. DESAI 
Belmont Hospital, Sutton, England 


PROBLEM 


Cameron “-*) has regarded overinclusion as one of the characteristics of schizo- 
phrenic disorganization. Epstein®) devised a test to measure the degree of over- 
inclusion in concept-formation comprising 50 key words, each followed by 6 response 
words (including “‘none’’). The subject has to underline for each stimulus word all 
response words which designate things or concepts required for the complete thin: 
described by the key word. Five of the 50 key words are neologisms. Epstein foun 
a significant difference in the expected direction in the overinclusion scores of 38 
schizophrenics as compared with the scores of 45 normals matched for age, sex and 
vocabulary as measured by the Shipley-Hartford Retreat scale. He also found that 
the Abstraction score on this last test did not correlate significantly with overin- 
clusion. Payne‘ used the Epstein test on 18 schizophrenics and 16 neurotics 
matched on the Mill Hill Vocabulary Scale. The schizophrenics had a very much 
higher mean. However, in a later study Payne? compared 20 schizophrenics with 
an equal number of neurotics, endogenous depressions and normals respectively who 
were matched for age, education, vocabulary and occupation. They were also given 
an intelligence test. The Epstein test did not differentiate well and the overinclusion 
score had a “very high loading (about .7) on a clearly defined factor of general in- 
telligence’’. 

In evaluating the practical usefulness of the Epstein test, therefore, it is neces- 
sary to consider the effects of intelligence and verbal knowledge on the overinclusion 
scores in psychiatric populations, and this was the main purpose of this study. A 
secondary aim was to derive norms for overinclusion for a primarily neurotic popu- 
lation. These scores are also compared with the scores of schizpohrenics as obtained 
in other studies. 

PROCEDURE 

Raven’s Progressive Matrices“, Mill Hill Vocabulary (Synonyms) Test), 
and Epstein’s Over-inclusion Test' were given to 217 male psychiatric cases at 
Belmont Hospital. It was thought that the order in which the M. H. V. and the 
O-I tests were given might influence the O-I scores, since the M. H. V. requires the 
subject to choose one, and only one, word out of six for each key word. If these in- 
structions were carried over to the O-I test to any extent, they might tend to limit 
the O-I score and to increase the neologism score. To control this possibility, the 
M. H. V. was given before O-I in 137 cases and the order was reversed in 80 cases. 
Those patients who showed evidence of psychosis or intracranial pathology were 
eliminated. Also patients who gave an unreliable result in the Matrices, as assessed 
in terms of a deviation of a total of five or more points from the expected scatter in 
the five sets of the test, were eliminated. This selection left 80 subjects in the first 
group and 40 in the second group. The later were matched for age, intelligence and 
vocabulary levels with 40 subjects from the first group. As there was no significant 
difference in their O-I scores, the two groups were pooled to give a total of 120 cases. 

The distribution of psychiatric diagnosis based on the most prominent symp- 
toms, was as follows: Anxiety 35, Hysteria 9, Obsessional Neurosis 4, Neurotic De- 
pression 12, Neurotic or Psychopathic Personality Disorder 57 and Psychosomatic 
illness 3. The mean age was 34.71 years (S. D., 8.76). The mean score on Matrices, 
corrected for age, was 51.88 (S. D., 5.18) and on M. H. V. 29.25 (S. D., 5.77). 


*The writer is grateful to Dr. David Shaw, the Medical Administrator and the psychiatric staff of 
Belmont Hospital for permission to use their — as subjects and to Dr. Robertson and Mr. 
Senn Ay of Netherne Hospital for supplying ata on a iy ere population. The writer is also 
Ses ul to a Roy Secker, Senior Technical Assistant to the Psychology Department, for his help 
in the research. 


‘Progressive Matrices will be referred to as Matrices, Mill Hill Vocabulary as M. H. V. and Over- 
inclusion as O-I. 
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RESULTS AND Discussion 


The Pearson r between the Matrices scores (corrected for age) and the O-I 
scores was —.350. For 118 degrees of freedom this is significant well beyond the .01 
level. Thus there seems to be some tendency for O-I scores to vary inversely with 
intelligence level as assessed by the Matrices test. The mean Matrices score (cor- 
rected for age) is 51.88, with 8. D. only 5.18, which makes the level of our sample 
considerably higher than that of the standardization sample. If we had a sample 
with more normally distributed scores on the test, the r might have been higher. 
This prediction needs to be tested. 

The Pearson r between the M. H. V. and O-I scores was —.449, also significant 
well beyond .01 level. Thus, O-I scores tend to vary inversely with verbal knowledge 
as assessed by the M. H. V. 

It is not surprising that intelligence level should be related to O-I scores. One 
of the factors involved in doing the O-I test is presumably the ability to distinguish 
between essential and unessential relationships, and the educing of relations and 
correlates has been regarded by Spearman“) as basic processes in intellectual 
functions. Indeed some intelligence test batteries use verbal or non-verbal tests 
which are very similar to the O-I test. 

In view of the fact that intelligence and verbal knowledge are correlated (in our 
sample the r between Matrices and M. H. V. was only 0.249, but significant), the 
influence of intelligence was partialled out. There was still a correlation of —0.339 
between M. H. V. and O-I scores, which is significant beyond .01 level. 

The mean O-I score of our population was 19.80 (S. D. 18.46). A comparison of 
this with the scores in other studies can be of only limited value in view of the fact 
that the population data are not strictly comparable. However, a comparison may 
be of some interest in assessing the possible use of the Epstein Test for purposes of 
differential diagnosis. As compared with our mean O-I score of 19.80, Epstein’s 38 
schizophrenics had a mean of 20.92. In Payne’s study of 18 schizophrenics the mean 
was 46.9 (S. D., 51.0) and in an unpublished study by Philip Ley it was 35.88 (S. 
D., 30.14). Thus there are wide variations between the scores of schizophrenic popu- 
lations, implying differences of sampling. From his own population Epstein suggested 
a cut-off O-I score of 18 as indicative of pathological overinclusion. Such a cut-off 
score cannot be used in the discrimination of schizophrenic illness from neurosis or 
psychopathy since our population has a mean O-I score almost as high as that of 
Epstein’s schizophrenics. 

A similar conclusion emerges when Ley’s data are analysed in terms of the 
scores of 28 schizophrenics from the admission ward and an equal number from 
chronic back wards. Seventeen of the former and only 7 of the latter had a score of 
20 and under. The value of 20 was chosen for the dichotomy as it is nearest to the 
mean of our non-psychotic population. A x? value of 7.291 was found, significant at — 
01 level. Further, Payne and Hirst‘) comparing 11 depressives with 14 normals 
found that the depressives had a mean O-I score of 37.91 (S. D. 22.87). This is of 
course much higher than Epstein’s mean for schizophrenics. These figures indicate 
caution in the use of the Epstein test for differential diagnostic purposes. 


SUMMARY 

Epstein’s Over-inclusion Test, the Progressive Matrices Test and the Mill Hill 
Vocabulary (Synonyms) Test were given to 120 psychiatric cases with a diagnosis 
primarily of neurosis or personality disorder. The Pearson rs between intelligence 
level as assessed by Progressive Matrices, verbal knowledge as assessed by Mill Hill 
Vocabulary, and Overinclusion scores on the Epstein Test were determined. The 
relationships of intelligence and verbal knowledge to Overinclusion scores were found 
to be negative and significant. 

The Overinclusion scores of our population were discussed in relation to those of 
schizophrenic populations. The mean score of our non-psychotic sample was found 
to be higher than Epstein’s cut-off score for normals and schizophrenics, and at 


= 
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least as high or higher than the scores of 61% of a sample of Admission Ward schizo- 


419 


phrenics. The study points to the need for caution in the use of the Epstein Test for 
differential diagnosis. The influence of intelligence, of verbal knowledge and of 
neurotic or character disorders, need to be borne in mind in the interpretation of the 


Epstein test scores. 
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Efficient clinical work in institutions with 
a patient populations requires the availability 
of tests whi & on be briefly administered and 
ey reted. This study investigated the use of 

ipley-Hartford Retreat Scale“) as a means 
> estimating intelligence. Survey of the litera- 
ture for 1950-1958 indicates a paucity of studies 
on this instrument. 


MeETHOD 


Scores for all Ss who had taken both the 
Wechsler Adult Intelligence Scale (WAIS) and 
the Shipley-Hartford Scale (SH) since 1955 were 
obtained tom the records of patients at the 
Veterans Administration Neuro ropsychiatric Hos- 
pital in Palo Alto, California. No other criterion 
was employed. A total of 29 Ss were found who 
met this criterion. The SH Mental Age scores 
were converted into IQ estimates by dividing 
each score by 16. 

REsULTS 

The mean SH IQ score (104.3, 8. D. 16.5) does 
not differ significantly from the mean WAIS Full 
Scale IQ score (101.8, S. D. 16.0). Further evi- 
dence of the closeness of the association between 
SH and WAIS estimates was obtained by com- 


1The author is trap to Quinn McNemar for 
his help in critic evaluating this manuscript. 
This study was cmpinel at Stanford University. 


ting Pearson product-moment correlations 
| sre I SH IQ scores and the WAIS Full Scale 
IQ (FS), the WAIS Verbal IQ (VS), and the 
WAIS Performance IQ (PS) scores. Correlations 
were as follows: SH-FS .76, SH-VS .83, SH-PS 
.58, all significant at less then the .001 level or 
better, e odd-even reliability coeffici-nt cor- 
rected by the Spearman-Brown formula was .84. 


SuMMARY 


This study investigated the use of the Shipley- 
Hartford Retreat Scale as a brief estimate of in- 
telligence. Ss were all 29 patients who had taken 
both the Shipley-Hartford Scale and the Wech- 
sler Adult Intelligence Scale at the Palo Alto 
bp Administration Neuropsychiatric Hos- 
a within the period 1955-1959. Correlations 

tween the Shipley-Hartford and Wechsler 
estimates of intelligence were significant at the 
.001 level or better. It was concluded that the 
Shipley-Hartford could be useful as a screening 
measure of intelligence with neuropsychiatric 
patients. 
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AN APPLICATION OF THE METHOD OF EXTREME DEVIATIONS TO 
THE WECHSLER INTELLIGENCE SCALE FOR CHILDREN 


BARRY P. FROST 


Child Guidance Training Centre, London, Great Britain 


PROBLEM 

As Bradway and Benson point out with 
respect to the Wechsler-Bellevue Scale“), the 
use of the method of comparison of sub-test 
mean scores in various diagnostic categories has 
not produced information of much value to the 
clinician. The method of extreme deviations, 
introduced by these authors, seems likely to be a 
more useful device for differential diagnosis. 

The pa of the present study first is to 
calculate what might reasonably be called a hi, 
or low score (extreme deviation) on each of the 
sub-tests of the WISC, and second to see if these 
extreme deviations are associated with any or 
several diagnostic categories. The major ques- 
tions asked are: Do we find certain extreme 
WISC sub-test deviations invariably associated 
with a particular diagnostic category? Do we find 
extreme deviations which are never found with a 
particular diagnostic category? Do we find ex- 
tremes whose associations with one diagnosis 
rather than any other are statistically significant? 


METHOD 


Our data consist of the WISC results of the 107 
children who happen to have been given this 
test in full (at least 9 sub-tests) rather than any 
other intelligence test, in this clinic during the 
five years prior to January 1959. The age range 
of the Ss was from five to fifteen years. WISC 
1Qs ran from 59 to 132 (Full Scale), mean 
99.0 and 8. D. 15.3. The distribution varies 
slightly from the expected but not sufficiently 
to be statistically significant (0.10 < p < 0.20). 
There are 79 boys and 28 girls in the sample 
(74% and 26% respectively), about the usual 

roportion found in child guidance referral 
However, this proportion is not main- 
tained within the various diagnostic categories. 

The children are placed in ten diagnostic 
categories on the basis of the psychiatric descrip- 
tion. They are “Withdrawn’’, ‘Immature’, 
“‘Anxious-H yperactive’’, “‘Depressed-Passive-De- 
pendent’, ‘‘Psychopathic-Affectionless”’, ‘Ob- 
sessional’, ‘‘Severely Disturbed”, Reactive Be- 
havior’, ‘Repressive”’, and “Others”. The 
categories were arbitrarily made mutually ex- 
clusive; where there was any overlapping of 
descriptive categories, that which appeared to be 
the major category was chosen as the criterion. 
The category ‘“Others’’ consists of those cases in 
which, for any descriptive category, N < 5. An 
exception was made in the case of “Severely 
Disturbed’. 

To determine what should be considered an 
extreme deviation, sub-scores were averaged for 
each individual separately on both the Verbal 
and Performance es. Each sub-score was 
then taken as a deviation from its respective 
(Verbal or Performance) Scale Mean. There was 
no grand mean, the scores of all the subjects were 
never aver: for any sub-test. Thus from 79 
to 107 deviations for each of the 11 sub-tests were 
obtained (not all subjects did all the sub-tests). 
These were placed in frequency distributions and 
extreme deviations were selected as those within 
the first or last 20th percentile of these distribu- 


tions. Thus an extreme deviation was defined as 
one which not more than 20% of the sample 
reached. The lower limit was defined by counting 
in 20% of the cases from both ends of the distri- 
bution and noting the deviation score there. It 
will be seen that the numerical value of the ex- 
treme deviation differs from sub-test to sub-test. 

Two by two contingency tables for all subjects 
in each diagnostic category versus all other sub- 
jects and for presence of the deviation against its 
absence were prepared and Chi-Squares with 
Yates Correction computed ®), 


REsULTS 
None of the extreme deviations were associated 
exclusively with any one diagnostic category. 
Each extreme deviation is represented in almost 
all diagnostic categories; where there are zeros 
in the cells, they are not significant due to the 
small N in these categories. The only one which 
approaches significance is Vocabulary Plus in the 
“Anxious-Hyperactive’ group (0.10 > p > 
0.05). Hence no extreme deviation excludes any 
category completely. 
owever, some statistically significant find- 
ings do appear. Block Design Plus and Picture 
Completion Minus appear frequently enough in 
the “Obsessional” group to be significant at the 
0.05 levels. The ‘Psychopathic-Affectionless’’ 
group is associated with Arithmetic Plus and 
icture Completion Plus, both at the 0.05 level. 
Arithmetic Plus and Comprehension Minus are 
found in the “Immature” group at the 0.01 and 
0.02 levels of significance. Vocabulary Plus in 
the ‘“Repressive”’ and Block Design Plus in the 
‘Severely Disturbed’’ groups are both frequent 
enough to be at the 0.05 level. Lastly, Compre- 
hension Plus is associated with the ‘“Anxious- 
Hyperactives’’ at the 0.02 level of significance. 


SUMMARY 


It has been suggested that certain combinations 
of high and low sub-test scores on the Wechsler- 
Bellevue Scale are diagnostic at ‘“‘psychiatric’’ 
levels. The present investigation examined 107 
WISC records from a child guidance clinic popu- 
lation by the method of extreme deviation in 
order to determine = —_ of high and low 
scores to specific clini diagnostic patterns. 
Deviation of each sub-test score from the scale 
mean for each individual record was calculated 
and the child placed in a diagnostic category on 
the basis of psychiatric description. Results in- 
dicated that none of the extreme deviations was 
specific to any one psychiatric category or dis- 
criminated between categories. Some statistic- 
ally significant results are described. 
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ProBLEM AND METHOD 
This study investigates the possibility that 
the Verbal Scale of the WAIS®? can be abbre- 
viated to effect a considerable saving of time with- 
out sacrifice of either ages on or the ar of 


fun riginal instrument. 
The WA Verbal Seale reo Seale records of 100 con- 


oisiadte patients 16 to 72 at the Middle- 
town State Hospital were taken from the files and 
rescored using selected items for each subtest ac- 
cording to the scheme in Table 1. Ev 
item was used on the Information and Vocabu- 
lary subtests; only odd items were used on Com- 
ehension, Ari tic and Similarities. The 
it Span was not abbreviated. For each sub- 
test the total raw score was then multiplied by 
the appropriate factor as indicated in Table 1 
and the results converted to weighted scores in 
traditional manner. Wechsler’s tables were then 
utilized as usual in obtaining the abbreviated 
Verbal IQs. A plication of the ab- 
breviated scale showed t it tended to exag- 
gerate IQs over 90. For this reason all IQs of 90 
or over on the abbreviated scale were corrected 
ry subtracting 2 points from the obtained Verbal 


1. Desicn or ABBREVIATED SCALE 


Multiply 
Score by 


Subtest Items used 


Information Every third item 3 
Comprehension Odd items only 2 

it Span Unchanged 1 
Arithmetic Odd items only 2 
Similarities Odd items only 2 
Vocabulary Every third item 3 

REsuLTs 
oaks ts the Verbal IQ scores on the 


and what the subjects 

ba have obtained had they been 
abbreviated form and ey, the same re- 
he scores in Table 2 in- 
ide the correction of 2 points for IQs over 90 as 
noted above. The mean deviation between orig- 
inal IQs and those obtained on the abbreviated 
Verbal Scale was 2.7 points of IQ. 71% of the 


scores on the abbreviated form came wi plus 


AN ABBREVIATED FORM OF THE WAIS VERBAL SCALE 


WILLIAM WOLFSON AND LEONARD BACHELIS 
Middletown State Hospital 


TaBLE 2. ORIGINAL AND ABBREVIATED VERBAL IQs 


or minus 3 points of the original IQ; 85% fell 
within 4 points. The largest deviation observed 
was one case scoring 7 points higher on the abbre- 


viated form. 
Discussion 


Inspection of Table 2 reveals that for all Verbal 
IQ ranges the correlations between the short and 
original forms were consistently high and that 
these correlations were statistically significant. 
This suggests that the abbreviated version may 
have considerable merit in research where it is 
essential to obtain the verbal intelligence of large 
groups and time is a factor. The mean time for 
administering and scoring the abbreviated verbal 
scale given to 20 randomly selected hospital 
patients was 15.8 minutes. 

Most previous attempts at abbreviating the 
Wechsler scales have concentrated on using 
selected subtests ®: 42-4), Such abridgments 
have the disadvantage that the variety of mental 
processes sampled is necessarily limited. The 
present sthed however taps whole variety 
of functions contained in the original verbal scale 
and therefore would seem to be useful in the clini- 
cal situation. The obtained correlation of .97 for 
the short form with the original form compares 
very favorably with Wechsler’s reliability co- 
efficient of .96. The standard deviation of the 
differences in the two IQs of 2.9 compares equally 
favorably with Wechsler’s standard error of 
measurement of 3.0. 


SuMMARY 


By using selected items on each of the sub- 
tests of the Verbal Scale of the WAIS it was 
shown that a considerable saving of time could 
be had while yet retaining the reliability and the 
variety of functions of the original verbal scale. 
The legitimacy of using the abbreviated form for 
either research or individual clinical examination 
was 4 
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Original Form 
Mean 8.D. 


Short Form 
M 


Original IQ 

55-74 19 66.6 5.8 68.7 5.3 .83 .001 
75-89 32 82.6 4.1 83.0 4.7 .74 
90-109 36 97.6 6.4 98.0 .87 .001 
110-128 13 116.5 4.1 116.0 5.5 .80 .001 
55-127 (tot) 100 89.4 16.0 90.0 15.6 .97 .001 


RORSCHACH PRODUCTIVITY AND PARTICIPATION 
IN GROUP PSYCHOTHERAPY! 


GORDON E. RADER 
University of North Carolina 


PROBLEM 


A number of writers on the Rorschach have pointed out the interpersonal 
aspects of taking a psychological test and drawn parallels between the Rorschach test 
situation and other clinical situations. Zubin’, in fact, refers to the Rorschach as a 
form of standard interview under the veil of ink blots. He feels that a patient will 
manifest many of the same characteristics and behavior tendencies while taking the 
Rorschach that he does while being interviewed. It might seem, then, that the Ror- 
schach could be useful in predicting a patient’s productivity in a group therapy situa- 
tion, knowledge of which can often be very helpful in selecting the right patient for 
the right group. The present study investigated the relationship between productiv- 
ity while taking the Rorschach and productivity in group psychotherapy. 


MeEtTHOD 


Subjects. The subjects for the study consisted of 45 state prison inmates divided 
among six discussion groups meeting weekly at the prison. An effort had been made 
to rule out mental defectives and psychotics and to choose ‘“‘acceptable’’ candidates 
for psychotherapy. 

Four of the groups were conducted jointly by two psychiatric residents. The 
remaining two groups were conducted by a psychologist with the author as observer. 
The psychiatric residents were quite active in the conduct of their groups while the 
psychologist intentionally assumed a passive, laissez-faire policy with regard to his 
two groups. Because of the unknown effects of these two methods of handling the 
groups, the results will be handled separately for the psychiatrists’ four groups 
(Sample A, N = 26), and the psychologists’ two groups (Sample B, N = 19). 


Rorschach data. Rorschachs were all administered by the experimenter, before 
or during the first month of the group discussions. With the exception of four sub- 
jects who were observed once, all subjects were tested before being observed in the 
groups by the experimenter. The administration approximated the procedure sug- 
gested by Sarason“. If the subject stopped after giving only one response to the 
first card he was encouraged to see more. After that no further encouragement was 
given. Any questions about number of responses desired were answered with ‘‘what- 
ever you wish.” 

Three measures of productivity on the Rorschach are employed in this study: 
Number of responses, self-expression divided by number of responses (S-E/R), and 
emotionalization divided by number of responses (Em /R). The latter two measures 
are derived from a previous study by the author“ and a more extensive explanation 
can be found therein. Each response to the Rorschach was separately rated for Self- 
expression and Emotionalization. Self-expression represents the degree of a personal, 
subjective element in the response and is based upon a combination of such factors 
as the normative frequency, form level, specificity and emotional intensity of the 
response. Emotionalization is a rating of the emotional intensity of the response and 
is one of the factors contributing to the Self-expression rating. Dividing the sum of 
the ratings within a record by number of responses provides the average Self-ex- 
pression and Emotionalization per response. Rorschach protocols with relatively 


1This investigation was oy te: in part by research grant FRF 1957-40 granted by the Uni- 
niversit 
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high S-E/R and Em /R would be generally considered rich, subjectively determined, 
and more reflective of inner drives or personality factors. A protocol with low 
S-E /R and Em /R would be characterized as banal, non-committal, stereotyped, and 
less reflective of the inner life of the subject. 


Therapy participation data. The extent of participation of each subject in his 
particular group was rated independently by both therapist and observer following 
each session. The following six-point scale was used: 

No participation—did not speak at all 

Minimal participation—did less than 5% of the speaking. 
Occasional participation—did 5-10% of the speaking. 

Moderate participation—did 10-20% of the speaking. 

Active participation—accounted for 20-40% of the speaking. 
Dominated group—accounted for more than 40% of the speaking. 


The particular scale was not originally constructed to reflect any of the many quali- 
tative features of participation, such as the significance, emotional intensity or self- 
revelation of the communication.? However, since the ratings were made by memory 
immediately following the session it is likely that error was in the direction of giving 
increased weight to the more significant communications since these would have left 
a greater impression upon the raters. 

For various reasons (conflicting assignments, parole, discharge, transfer, pun- 
ishment, etc.) the number of sessions rated differs considerably among the sub- 
jects. The range was from 3 to 15 in Sample A, median 13.5. Ouly 3 of the subjects 
attended less than six sessions. In Sample B, the range was 3 to 13 sessions, median 
10. Only two subjects attended less than seven sessions. 


RESULTS 


None of the six correlation coefficients computed even approached significance 
(Table 1). No relationships were found between measures of productivity on the 
: Rorschach and a measure of verbal productivity in group therapy sessions. 


1. CorRELATIONS BETWEEN INDICES OF RorscHacu PRODUCTIVITY AND 
RatInGs OF PARTICIPATION IN GRouP PSYCHOTHERAPY 


Sample A Sample B 
Rorschach Indices (N = 26) (N = 19) 


Responses (R) 
Self-expression mses (S-E/R) 
Emotionalization / nses (Em/R) 


The absence of relationships between these measures cannot be attributed to 
unreliability. The interscorer reliability for the three Rorschach measures was .99, 
.86, and .89 in Sample A for R, S-E/R and Em /R respectively, and .99, .47, and .81 
for the same variables in Sample B. Interrater reliability of the group therapy parti- 
cipation ratings, computed session-by-session, was .84 in Sample A and .86 in 
Sample B. 

SuMMARY AND CONCLUSIONS 


The present study failed to find any relationships between three measures of 
Rorschach productivity and productivity in group psychotherapy. These results 
raise grave doubts about the common assumption that the Rorschach test is simply 
a standardized means of sampling behavior and that such behavior is likely to be 


*An attempt was made to rate several of these factors from the extensive therapy notes in Sample 


B. All ratings were highly intercorrelated and seemed to represent primarily the amount of attention 


the icular inmate received in the therapy note. None m | these ratings was significantly related to 
the measures, 
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representative of behavior in analogous ordinary life situations ‘** *e 2, p. 3). On 
the basis of his own experimental results, Gruen“) has cautioned against making 
simple inferences of direct relationships between Rorschach behavior and life be- 
havior. The present results add further weight to this caution. 
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RORSCHACH CARD REJECTION AND IQ 
T. G. CROOKES AND ANNA J. KELLER 
St. John’s Hospital, Aylesbury, England 


Tamkin’s“ report of the negative correlation between IQ and number of 
Rorschach cards rejected by male patients prompted us to repeat the work on a 
group of female patients whose Rorschach records been tabulated for a different 
purpose. As we had a larger group, we were also enabled to consider whether the 
relationship varied for different cards, some cards perhaps being more likely to be 
rejected for emotional reasons, others being more ‘difficult’ from a strictly intel- 
lectual point of view. 

A group of children’s records was also examined to see if there was any differ- 
ence in the relationship there. We were unable to make a prediction of the direction 
of the difference (if any), because of the difficulty of understanding in what way the 
intellectual factor operates. It is clearly not just a matter of intellectual ability, 
since some defectives give records without rejections and people of very high IQ 
sometimes reject. We considered two ways in which the factor might operate: (a) 
On a strictly intellectual basis, greater ability enables S to make more of the blot, to 
organize it better, but against this the 8 of higher IQ has higher standards of what 
constitutes an acceptable response. These tend to cancel each other out, but if 
Tamkin is right, the first one predominates in adults, so that the more intelligent 
reject less often. (b) Supposing there is an emotional factor in rejection, there might 
be a tendency to reject a card because what it suggests is unpleasant or embarrass- 
ing. In this situation, the S of higher IQ might be more aware of the possible sig- 
nificance of rejection, hence more likely to try to find an alternative response; also 
he would have less rigidity and more ingenuity to enable him to find such an alterna- 
tive. In case (a) children might show the same relationship between IQ and re- 
jection as adults, if the balance between ability and standards is more or less the 
same at different ages. In case (b), the child’s greater straightforwardness and 
naiveté in the situation might make the rejection a more direct indicator of emo- 
tional significance in the cards, and intelligence might have less effect. 


METHOD 


The adult group consisted of 293 ee peepee outpatients of St. John’s 
Hospital, aged 16 to 69. They had been as a routine investigation; they all 


pray ech., 19974, Z1, 294-306. 
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did the test in similar out-patient conditions and with the same examiner (T. G. C.). 
The IQ in all cases is the Wechsler-Bellevue, Form I. All records are included in 
which both Rorschach and IQ were done. The children had also been seen in the 
same psychiatric out-patient department, and the tests were carried out in similar 
conditions by the same examiner. Again, all cases with the requisite data are in- 
cluded. There are 217 children (134 boys, 83 girls) aged 5 to 15 inclusive, (but there 
are only 7 under the age of 7). The IQs here are derived from three tests, the Wech- 
sler-Bellevue, Form I (N = 126), the W. 1.8. C. (N = 26), and the Revised Stanford- 
Binet (N = 65). The good correlation among these tests“) makes it reasonable to 
mix the types of 1Q for this purpose. 


RESULTS 


Table 1 shows the adult group divided according to the number of Rorschach 
cards rejected, with the mean age and IQ of each sub-group. Tamkin’s correspond- 
ing means are shown in brackets. 


1. Mean and IQs or Ss Resectina DirFERENT NUMBERS OF 


Cards Rejected Mean IQ 


0 169 33.5 °110.3 (107.6) 
1 58 32.0 105.1 (107.1 
2 30 32.4 101.2 (102.0 
3 16 37.8 94.1 (101.2 
4 or more 20 30.3 95.1 89.8) 


The mean IQ decreases steadily with the number of cards rejected, except for a 
slight reversal in the last group, due mainly to one S with 1Q 132. The mean of the 
124 Ss who rejected one or more cards is 101.1; comparing this with the mean of 
those who did not reject at all gives ¢ = 5.05, p < .001 (two-tailed test, as are all the 
significance tests quoted). Other comparisons are: No rejections versus one re- 
jection, t = 2.48, p < .02; one rejection versus 2 or more rejections, ¢ = 3.20, 
p < .01. The mean ages were calculated, as it was thought that there might be an 
increase of rejections with increasing age. However, it will be seen that the ages of 
the five groups vary little and show no definite trend. 

To compare the 1Qs of Ss rejecting different cards, the purest way is to consider 
the instances where one card only is rejected. Only 2 cards occur often enough as 
single rejections to make a large enough group for comparison—cards IX and VI, 
which each occur 15 times. These two cards are suited to our purpose, since card IX 
is thought to be intellectually difficult to organize into a good response“), and card 
VI has some of the most obvious sexual content. If card VI tends to be rejected for 
emotional reasons and card 1X for intellectual reasons, Ss who reject VI might be 
expected to have a higher 1Q than those who reject IX. The mean IQ of the 15 Ss 
rejecting IX alone is 104.2, the mean of the 15 who rejected VI alone is 102.7. The 
difference is insignificant and in the wrong direction. To extend this method, the 4 
cards were selected from Klopfer’s®) content analysis which are said to have areas 
commonly seen as sexual, cards II, IV, VI and VII. The IQ of Ss who rejected one 
or more of these cards but none of the other six was compared with that of those who 
rejected one or more of the other six but none of these four. There are 41 Ss in the 
“sexual” group, mean IQ 103.6; the other group contains 28 Ss, mean IQ 105.1. 
Again, the difference is slight and in the wrong direction. 

The mean IQ of children who reject no cards is 113.5 (N = 122) of those who 
reject one or more 110.0 (N = 95). The difference is not significant. However, the 
results are more interesting when they are divided into two age groups. They were 
split as near the mid-point as possible while keeping years intact. Table 2 shows the 
mean IQ of the two age groups, boys and girls separately. Groups showing a sig- 
nificant difference between those who reject and those who do not are Girls under 


_ (Aputts) 
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2. Maan IQs or CuILDREN wHo Resect or Do Nort Rasxct, By AGE 
Group Spx 


No Rejections 1 or more Rejection 


12 (t = 2.28, p < .05), Boys 12-15 (¢ = 2.94, p < .01) and Total 12-15 (¢ = 3.34, 
p < .01). The older children show the same pattern as the adults, but with the 
younger ones, those who reject are, if anything, of higher IQ, than those who do not. 

The young children are of considerably higher IQ than the Adults, and it might 
be felt that the findings could be due to this bias. Table 3 shows the proportion of 
Ss Hy in the 3 age groups, showing three IQ groups separately, low, medium 
an 


TaBLeE 3. Proportion or Ss Resectinc THREE AcE AND IQ Groups 


Group IQ (< 91) » High I Q (> 119) 
N N Rejecting Rejecting NN Rejecting 


Adults 40 25 (62.5 } 188 82 (43.6 65 17 (26. oe} 
Ch.12-15 | 16 11 (68.8% 63 28 (44.4 27 


Ch. <12 4 2 63 25 44 


The older children are very similar to the adults in all IQ groups; the younger ones 
are similar to the adults in the medium IQ group but show a much greater proportion 
of rejections in the high IQ group (x? Adults versus Children under 12 = 5.49, 
p < .02). 


SUMMARY AND CONCLUSIONS 


A study of the relationship between 1Q and rejection of Rorschach cards using 
the records of 293 female psychiatric out-patients confirmed at a high level of sig- 
nificance Tamkin’s finding of a negative correlation between IQ and amount of re- 
jection. No differences of IQ were associated with rejection of different cards. A 
group of 106 children aged 12 to 15 showed the same relationship between IQ and 
rejection as the adults, but in a group of 111 children under 12 those of higher IQ 
were just as likely to reject as those of lower IQ, in fact, slightly more likely. This 
finding supports the view that the role of IQ in this connection is to enable § to 
cover up a tendency to reject for emotional reasons, young children being less dis- 
ingenuous in this respect. However, this is not the only possible explanation of the 
results. 
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Group N Mean IQ N Mean IQ 
Under age 12 
Boys 45 117.1 30 116.5 
Girls 17 109.0 19 117.7 
Total 62 114.9 49 117.0 
ys 36 113.8 23 103.0 
Girls 24 109.6 23 102.1 
Total 60 112.1 46 102.5 


THE ADAPTATION AND VALIDATION OF THE DRAW-A-PERSON 
TEST AS A MEASURE OF SELF CONCEPT 


RAYMOND F. BODWIN AND MAX BRUCK! 
Flint (Mich.) Child Guidance Clinic 


PROBLEM 
The present study reports an attempt to construct a quantified self concept 
scale of the Draw-A-Person Test (SCS-DAP)® It is hypothesized that the SCS- 
DAP is a valid measure of self concept defined as consisting of these elements: (a) 
self-confidence, (b) freedom to express appropriate feelings, (c) liking for one’s self, 
= satisfaction with one’s attainments, and (e) feeling of personal appreciation by 
others. 


METHOD 
An inspection of many drawings revealed 13 characteristics thought to reveal 
self concept which are listed and defined as follows: 
1. Shading. Light, dim subtle, and uncertain lines which furtively accent particular parts 
of the figure. Patterned or stylized shading. 


2. Reinforcement. Shading of the boundaries of clothing or the figure. Heavy dark lines 
or parts of the drawing emphasized through retracing over the same area. 


3. Erasures. Any attempt to alter or perfect all or part of the drawing through erasure. 

4. Detailin figure. Unessential features or details added to the figure or background. 

5. Sketchy lines. Parts of the body, particularly the outline defined by light, broken, 
blurred, vague, fuzzy lines. 


6. Transparency. Body of the figure completely transparent or inadequately clothed so 
that body parts ordinarily covered are shown. es 


7. Asymmetry. Imbalanced and lopsided arrangement of the body parts in respect to size, 


shape, or pele on the opposite sides of the center. 

8. Distortion. Any unnaturalness or irregularity in form. Any non-human aspects to 
figure drawn often displayed by size disproportion. 

9. Incompleteness. Figure not drawn complete, lacking in significant body parts or cloth- 


" 10. Mézed-age. Disparity in the physiological maturation of various body parts such as 
hasized i childish body. 


breasts emp’ in an otherwise 

11. Opposite identifications. Figure drawn is of the opposite sex of the subject or if of 
the same sex, opposite sex c' teristics are displayed. 

12. Primitiveness. Over-all figure is crudely and roughly drawn. Specific points are con- 
fusion of full and profile view of the head, mouth emphasis, trunk incomplete, caneion of the 
neck, and disorganized body representation. 

13. Immaturity. Drawing is marked by elaborate treatment of the mid-line such as Adam’s 
apple, tie, buttons, buckle, and fly on trousers. There is emphasis on mouth and or breasts. 


DAP’s from 60 subjects ranging from 10 to 17 years of age, equally divided for 
sex, were rated on a 5 point rating scale for each of these 13 characteristics. Table 1 
shows the rating scale employed for characteristic 1, shading. 


TaBLe 1. SHapING 


(0-20%) (21-40%) (41-60%) (61-80%)  (81-100%) 
5 4 3 2 1 


Markedly Markedly 
absent present 


Identical rating scales were used for the remaining 12 characteristics and the 
ratings for all 13 characteristics were summed for total score. The subjects were rank 
ordered by total scores on the rating scale and the upper and lower 27 per cent were 
identified. In order te determine those characteristics which best discriminated 


1Now at Michigan State University 
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mature from immature self concept, an item analysis of the 13 characteristics was 
accomplished by comparing the upper and lower 27 per cent for each characteristic. 
Those characteristics with the highest mean difference between the upper and lower 
groups were retained for the final form of the SCS-DAP. 

The validity of the SCS-DAP was tested by having a judge rate the 60 subjects 
on a 5 point rating scale for self concept (as defined above) on the basis of an in- 
dividual interview and without knowledge of the SCS-DAP. Pearson product- 
moment correlations were calculated to measure the degree of relationship between 
the judge’s ratings and the SCS-DAP scores for the 60 subjects. The standard error 
of this correlation was calculated to test the significance level. 


AND DiscussION 


Table 2 presents the item analysis of the original 13 characteristics of the SCS- 
DAP. In considering which characteristics should be retained or eliminated, a cut- 


Tasie 2. [rem ANALYSIS RESULTS OF THE ORIGINAL Form OF THE SCS-DAP 


Mean for Mean for 
Characteristics Upper 27% Lower 27% 


posite sex identification 
Sketchy lines 
Incompleteness 
Transparency 
Immaturity 
Primitiveness 
Reinforcement 


Difference 


off point of less then two points discrimination was arbitrarily chosen. In this way 
the four categories of shading, mixed age, asymmetry, and detail in figure were elim- 
inated in the final revision of the SCS-DAP. 

Table 3 presents the relationship between the judge’s rating scale of the self 
concept and the SCS-DAP. The magnitude of the correlation (.64) indicates that a 


3: Tue RELATIONSHIP BETWEEN THE JuDGE’s RatinG ScALE FoR SELF 
ConcEPT AND THE SCS-DAP (N = 60) 


Judge’s Rating Scale SCS-DAP 


Range 9-24 19-48 
Mean 18.1 34.7 
8. D. 6.77 7.96 
r .61* 
8. E.r 


*Significant at one per cent level of confidence 


positive and very significant relationship existed between the judge’s rating scale and 
the SCS-DAP scores supporting the hypothesis that the SCS-DAP is a valid measure 
of self concept. The standard error indicates (p < .01) that the obtained correlation 
is significant. It could be speculated that the narrower range of judge’s rating scale 
scores is due to the relatively small size of the population tested, or the homogeneous 
nature of the group, or both. However, the greater dispersion of the SCS-DAP 
scores would tend to rule out both speculations. Perhaps the narrow range of the 
judge’s rating scale scores is the result of an overly cautious approach on the part of 
the judge in rating either extremely mature or immature self concept. 


1 

== 
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SUMMARY 

Because of the importance of self concept in reflecting the total environment of 
the child, this study attempted to devise and validate a brief projective technique 
for the evaluation of self concept. The Draw-A-Person Test was selected for this 
purpose and a self concept scale was constructed. This test was administered to 60 
subjects and then submitted to an item analysis. The 60 subjects were also independ- 
ently rated for self concept by a judge after a psychiatric interview. The judge’s 
ratings and the self concept scale of the Draw-A-Person Test were correlated. The 


correlation, which was significant at .01 per cent level of confidence, supported the 
hypothesis. 
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AN INVESTIGATION ON THE MEANING OF HUMAN 
FIGURE DRAWING! 


DENNIS K. KAMANO 
Galesburg (Ill.) State Research Hospital 


The purpose of this study was to reassess the hypothesis underlying figure 
drawing interpretation that when a person responds to the request to draw a person 
he draws a picture of himself. To test the hypothesis that human figure drawings do 
represent the drawer’s perception of himself, the semantic differential technique 
devised by Osgood, et. al.“ was used. If this hypothesis was valid clinically, the 
human figure drawn by the person should be similar in meaning to his own self per- 
ception. That is, there should be a significant semantic similarity (smaller D values) 
between the rating of his figure drawing and his rating of himself. 


METHOD 


Test. Four concepts were chosen in order to compare the meanings or associa- 
tions assigned to each by the subjects (Ss) on the semantic differential. They were: 
the figure drawn by the Ss, My Ideal Self (IS), My Actual Self (AS), and My Least- 
Liked Self (LLS). Three different aspects of the Self were utilized to determine 
whether the Ss tend to draw a figure that represents themselves, or draw an idealized 
figure, or draw a figure with unfavorable characteristics. These concepts were rated 
on fifteen bipolar scales which were presumed to be relevant. The scales used in- 
cluded six representative of the evaluative factor (attracting-repelling, complete- 
incomplete, important-unimportant, healthy-sick, high-low, and sociable-unsociable), 
five for the potency factor (large-small, hard-soft, strong-weak, deep-shallow, and 
masculine-feminine), and four for the activity factor (active-passive, hot-cold, tense- 


relaxed, and aggressive-defensive). The same scales were used for each of the four 
concepts. 


Subjects. Forty five institutionalized women labelled as schizophrenics were Ss 
in the present study. The average age was 29.11 years, with a range of 18 to 38. 
The Ss were screened to determine that they were all sufficiently in contact and of 
adequate operating intelligence to understand and complete the tasks involved. 


1The author wishes to express his appreciation to Vasso Vassiliou for her assistance in the collec- 
tion of the data. 
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Procedure. Same-sex drawings of institutionalized women were used in this 
study. Each S was seen individually and was instructed to “draw a woman.”’ Follow- 
ing completion of the drawing each S rated her own drawing on the semantic scale. 
This was followed by ratings of the concepts IS, AS, and LLS, which were presented 
in counterbalanced order. 


RESULTS AND DiIscussION 


For each 8, the D values between the figure drawing and the concepts IS, AS, 
and LLS were determined by summing the squared differences over the fifteen scales 
and taking the square root of the sums. Rank order correlations between the figure 
drawing and each of the three concepts were also determined with results given in 
Table 1. 


1. Mean D-Scorzs (Upper VALUES) AND CoRRELATIONS (LOWER 
VaLvuEs) BETWEEN FicurE Drawinc JIS, AS, anv LLS. 


Is AS LLS 
9.63 8.68 11.04 
35 .59* .36 
‘Significant at the .01 level 


Figure Drawing 


The mean D for the figure drawing and AS is smaller than those for the other 
comparisons, and with a higher and significant correlation between the profiles. 
There is a greater semantic similarity between the figure drawing and AS than for 
the other comparisons. It appears, then, that the results of this investigation lend 
some support to the hypothesis that human figure drawings do represent the draw- 


er’s perception of himself. For our population, at least, the Ss tended to draw a 
figure that approximates their actual self much more than an idealized or unfavor- 
able one. 


SUMMARY 


The present study investigated the hypothesis that human figure drawings do 
represent the drawer’s perception of himself by utilizing the semantic differential 
technique. An attempt was also made to determine whether S tended to draw a figure 
that represented himself, or draw an idealized figure, or an unfavorable one. To do 
this, 45 hospitalized schizophrenic women drew same-sexed figures and then rated 
them together with the concepts My Ideal Self, My Actual Self, and My Least-Liked 
Self presented in counterbalanced order on a set of fifteen bipolar scales. 

The total Ss responded consistently with the hypothesis to a degree significantly 
greater than chance expectancy. The Ss tended to draw a figure that represented 
themselves much more than an ideal or unfavorable one. This was interpreted as 
supporting the hypothesis that human figure drawings do represent the drawer’s 
perception of himself. It was noted that while the hypothesis was supported, the 
extent to which such relationship could be accurately discerned by clinical observers 
remains unspecified. 
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PERFORMANCE DECREMENT ON THE H-T-P TEST 
AS A FUNCTION OF ADJUSTMENT LEVEL 
CHARLES V. LAIR AND E. PHILIP TRAPP 


Veterans Administration Hospital 
Fayetteville, Arkansas 


PROBLEM 

The present study tested the hypothesis that 
the deterioration in esthetic appeal from the 
House drawing to the Person drawing in the 
H-T-P. Test would be ter in maladjusted 
than in nonmaladjusted subjects. This is de- 
rived from a basic postulate that the two draw- 
ings tap a different level of personality, with the 
Person drawing operating more at the psycho- 
social level), and from Buck’s contention that 
the test itself is a progressively stressful task“). 


METHOD 

From a total of 87 H-T-P tests given as part 
of a test battery to patients from the VA Hospital 
at Fayetteville, Arkansas, two groups of 25 cach 
were selected. The maladjus group were sub- 
jects with a mixture of psychological and — 
complaints without clear-cut medical findings 
and a history of repeated hospitalization. The 
nonmaladjusted group were essentially first ad- 
missions with evident physical problems. The 
two groups were — with respect to age 
median 45 years), IQ (median 94), education 


status (median 8 grades), and occupational level 


(unskilled and semi-skilled). 

The subjects were matched in pairs across the 
two groups in terms of esthetic appeal of their 
House drawings by a faculty member from the 
art department of the University of Arkansas. 
To check on reliability, a second judge, also a 
faculty member of the art department of the 
University, independently matched pairs of 
House drawings. There was an 84 percent agree- 
ment between the judges. The Person drawings 
were then presented to the two judges for in- 
dependent evaluation. Judgements were based 
on global impr.ssion although the judges in- 
dicated that such specific factors as consistency 
in use of space concept, proportion, inventive- 
ness and imagination were involved. 


Resvutts anp Discussion 
Each judge selected 13 Person drawings from 
the maladjusted group and 12 Person drawings 
from the non-maladjusted group as reflecting 
most esthetic appeal. The percent of agreement 


between the two judges was 92. Hence, the results 
were clearly at chance level, and the hypothesis 
that the two groups would be differentiated on 
their Person drawings was not substantiated. 
This finding is in essential agreement with Sher- 
man’s®) results on patient and non-patient 


groups. 

Both judges commented that the Person draw- 
ings were generally inferior to the House draw- 
ings which supports the view that the Person is 
more difficult to draw. oreover, one j - 
ranked the 87 House drawings from whi 
final pairings were derived on a five-point scale 
of tic appeal as an incidental feature of this 
study. A definite trend was indicated with a high- 
er proportion of the maladjusted in the lower 
categories. This suggests some overall validity 
for the drawings. results suggest that 
drawings other than of a Person, e.g., House, 
Tree, etc., may be more clinically useful. Draw- 
ings that offer a better opportunity for elabora- 
tion and that offer less c nge to artistic ability 
may lead to less resistance and greater self ex- 
pression on the part of the subject. 


SuMMARY 

The hypothesis that more esthetic deteriora- 
tion from the House to the Person drawings on 
group com with a non- jus grou 
was tested. The drawings of 25 paired pel ten 
were rated on artistic quality with no differences 
found between groups. The implication of this 
result was briefly discussed. 
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THE BENDER GESTALT TEST FOR CHILDREN: 
A NORMATIVE STUDY 


ELIZABETH MUNSTERBERG KOPPITZ 
Endicott Public Schools, Endicott, N.Y. 


PROBLEM 

This paper presents a normative study of the Bender Gestalt Test) for young 
children. Recent studies have shown that the Bender Gestalt Test is especially 
useful as a developmental test of maturity in visual motor perception and coordina- 
tion. It is little influenced by emotional factors when used with first grade pupils © 
and appears to have considerable value both as a screening test for school beginners 
and as a test for diagnosing learning problems in the first through fourth grades». 
It was found that the Bender test can predict first grade achievement quite well and 
correlates also rather well with some Readiness Tests“: *». The B-G test may also 
prove useful as a non-verbal short intelligence test for young children; findings in- 
dicate a close relationship between the Bender test and both the WISC Performance 
Scale 1Q and the Arithmetic Subtest Score“. 


METHOD 


Subjects. The population consisted of 1055 school children representing 44 entire 
classes in 11 different schools located in rural, small town, suburban, and urban 
areas. Both colored and white children are included. Only pupils whose age was 
between 5 years and 10 years and 5 months were included in this study. The classes 
chosen were: 10 kindergarten classes, 13 first grades, 11 second grades, and 5 each 
of the third and fourth grades.' 


Procedure. The Bender Gestalt Test was administered individually to each S ac- 
cording to directions outlined by Bender“. Stimulus cards were presented one at a 
time and were removed as soon as the S had completed drawing a Figure. On Figure 
5 the Ss were discouraged from counting the dots as this is only time consuming and 
adds but little new information. AS who counts dots on Figure 5 will also count the 
dots on Figures 1 and 3. If however, a § persists on counting the dots despite in- 
structions to the contrary then it becomes diagnostically significant, suggesting 
perfectionistic or compulsive behavior. 

All Bender protocols were scored “blindly” by the author, that is, they were 
scored without any knowledge of the 8 other than the grade placement. Thereafter 
the protocols were arranged according to the Ss’ age, sex, and grade placement and 
the scores were tabulated. In addition, in 16 of the classes each 8 was timed from the 
moment he placed his pencil on the paper to begin drawing Figure A to the time he 
completed the last stroke on Figure 8. 

The scoring system used in this study was developed by the author especially 
for use with young children. It has been repeatedly cross validated since it was first 
published ®: *), The Revised Scoring System presented in Fig. 1 includes only items 
which differentiate consistently between the above and below average students in the 
first and second grades at the 5 per cent level of significance or better. Several 
changes were made from the original Scoring System. The Revised Scoring System 
consists of 30 scoring items none of which are mutually exclusive. This is a binary 
system, each item is scored as 1 when present and 0 when absent. Thus a child may 
obtain a maximum score of 30. Since the Bender is scored for errors only, a high 
score indicates a poor performance on the test, while a low score indicates a good one. 


'The writer gratefully acknowledges the help of Mrs. Jenny Raduege, David D. Blyth, John 
Sullivan, Joel Shelton and Dan Lutzger in obtaining part of the data. anks are also due to the 
school teachers and principals for their generous cooperation. 
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Fig. 1. Tue Revisep Scorina SysTeM FoR THE BenpER GesTaLt 


D1sTorTION OF SHAPE 


Figure A, 7 a. Parts of Figure excessively distorted or misshapen ; extra or missing angles. 
“ b. sNepengeeean between two parts of Figure, one must be at least twice the 
size of the other 


Figure 1, 3, 5 a. Five or more dots converted into circles; enlarged dots or partially filled in 
circles not scored. 
Figure 6 a. Three or more distinct angles eer wy ‘ed curves. 
b. No curves at all; one or two straight lin 


Figure 8 a. H extra or missing angles; 
diamond omitted. 


RoraTIon 


Figure A, 1, 2,3, a. Rotation of Figure or any part thereof by 45° or more; rotation of stimulus 
4, 5, 7, igen card if then copied in rotated position; wiatien of card and paper and then 
copied in correct position not scored. 


INTEGRATION 
Figure A, 4 a. Failure to join two parts of Figure; curve and adjacent corner more than 
1/8” apart; this applies also to overlap. 

Figure 2 a. One or two rows of circles omitted; row of dots of Figure 1 used as third row 
for Figure 2; four or more circles in majority of columns. 

Figure 3, 5 a. Shape of design lost; conglomeration of dots. 

b. Continuous lines instead of series of dots. 

Figure 6 a. Two lines not crossing or crossing at the extreme end of one or both lines, 
that is, less then 14” from end of the line; two lines interwoven horizontally. 

Figure 7 a. Hexagons do not overlap or overlap excessively, that is, one completel: 
penetrates through the other one. . iit 


a. More than 15 dots in a row. 
Figure 2 a. More than 14 columns in a row. 
a. Six or more sinusoidal curves in either direction. 


ReEsvtts AND Discussion 

Girls seem to mature a little earlier than boys do in visual motor function, yet 
none of the differences found between the mean Bender scores for boys and girls were 
statistically significant at any of the age levels tested. As would be expected of a 
developmental test the mean Bender scores decrease as the Ss get older, that is, the 
Bender improves consistently with age. Table 1 shows a marked drop in scores be- 
tween the.age of 5 and the age of 7. Thereafter the decrease is more gradual and 


TaBLe 1. DistTRIBUTION OF BENDER ScorEs BY AGE 


Age Group N Bender 8. D. +18. D. 


Score 
5-0 to 5-5 81 13.6 3.61 10.0 to 17.2 
5-6 to 5-11 128 9.8 3.72 6.1 to 13.5 
60 to 6-5 155 8.4 4.12 4.3 to 12.5 
6-6 to 6-11 180 6.4 3.76 2.6 to 10.2 
70 to 7-5 156 4.8 3.61 1.2to 8.4 
7-6 to 7-11 110 4.7 3.34 1.4to 8.0 
8-0 to 62 3.7 3.60 -lto 7.3 
86 to 811 60 2.5 3.03 Oto 5.5 
9-0 to 59 1.7 1.82 Oto 3.5 
9-6 to 9-11 45 1.4 1.43 Oto 2.8 
10-0 to 1 19 1.5 1.31 Oto 2.8 


PERSEVERATION 
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seems to level off at age 9 when sufficient maturity in visual motor perception has 
been acquired by most Ss to draw the Bender correctly without any distortions. 
Up to age 8 the Bender discriminates both those with outstanding visual motor 
perception and those with immature visual motor function. After age 8 the test no 
longer differentiates at the upper end; a score of 0 or the absence of errors falls within 
one standard deviation from the mean and indicates nothing beyond that the child 
has at least average visual motor perception. It may or may not be outstanding. 
Once the age has been reached at which the visual motor function has matured the 
Bender can no longer be used to screen out exceptionally capable children. On the 
other hand the Bender clearly differentiates those whose visual motor perception is 
below what would be normally expected on each of the age levels investigated in this 
study. Table 2 shows a close parallel between the mean Bender scores for each grade 


TaBLe 2. DisTRIBUTION OF BENDER ScorEs BY GRADES 


Grade N Mean Age Mean Bender 8. D. +18. D. 
Score of Bender 


Beginning Kindergarten 5-4 13.5 : 
Beginning Ist 6-5 8.1 4.14 
Beginning 2nd 7-5 4.7 3.18 
Beginning 3rd 8-7 2.2 2.03 
Beginning 4th 9-8 1.5 1.88 


level and the equivalent age level. When considering a child’s grade placement the 
Mean Grade Score may be more appropriate than the Mean Age Score. 

It was found that girls tend to be consistently a little faster than boys in com- 
pleting the Bender. However, no statistically significant differences were found be- 
tween the mean time required for this test by each sex and by the various age groups. 
The mean time for each age group varied between 6 minutes and 5 seconds to 7 
minutes with an overall average of 6 minutes and 30 seconds. The standard devia- 
tions for the age groups ranged from 1 minute 30 seconds to 3 minutes 30 seconds. 
Time only seems to be of importance if a child is very slow or very fast. A child who 
requires unusually long to finish the Bender may be extremely perfectionistic or he 
may reveal a problem in visual motor perception or coordination for which he can 
only compensate with much effort and time. Even if the final protocol is quite satis- 
factory these slow working children are likely to have difficulty in school since they 
cannot complete their work on time in class and especially on tests. On the other 
hand a child who dashes off the Bender test in a very short time tends to be impulsive, 
immature or negativistic. He shows a lack of concentration and/or effort to carry 
through the details required by this task. Only very rarely may an exceptionally 
bright youngster complete the Bender successfully in less than 4 minutes. Generally 
a very short time needed for the Bender is associated with poor test performance 
and with poor school achievement. The critical time limits for the Bender seem to be: 

Age 5-0 to 5-5 Time limit: 3 minutes to 10 minutes 

Age 5-6 to 5-11 Time limit: 4 minutes to 10 minutes 

Age 6-0 to 8-11 Time limit: 4 minutes to 9 minutes 

Age 9-0 to 9-11 Time limit: 4 minutes to 8 minutes 


SUMMARY 


This paper presents the Revised Scoring System for the Bender Gestalt Test 
for children together with normative data based on the performance of 1055 school 
children aged 5 to 10 years. Mean test scores and standard deviations are given for 
each age and grade level. Critical time limits for each group are also presented. 
Findings show that the mean Bender scores improve steadily up to age 9 and level 
off thereafter. The Bender differentiates both outstanding and immature visual 
motor perception up to age 8, thereafter it no longer discriminates at the upper end. 


9.9 to 17.1 
4.0 to 12.2 
1.5to 7.9 
2to 4.2 
Oto 3.4 
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The test continues to differentiate children with poor visual motor perception also at 
the higher age levels. No statistically significant differences were found between the 
mean Bender scores or mean time scores for boys and girls. 


REFERENCES 


1. Bernver, Laurerra. A Visual Motor Gestalt Test and its Clinical Use. New York: American 
Orthopsychiatric Association, 1938. 

2. Kopprrz, Evizaneru M. The Bender Gestalt Test and Learning Disturbances in Young Child- 
ren. J. clin Psychol., 1958, 14. 

3. Kopprrz, Etizasetu M. Relationships between the Bender Gestalt Test and the Wechsler In- 
telligence Scale for Children. J. clin. Psychol., 1958, 14. 

4. Kopprrz, M., Suuurvan, JoHN, Biyru, Davin D., and Prediction 
of First Grade School Achievement with the Bender Gestalt Test and Human Figure Drawings. 
J. clin. Psychol., 1959, 15, 164-1°8. 

5. Kopprrz, EvizaBETH M. Teacher’s Attitude and Children’s Performance on the Bender Gestalt 
Test and Human Figure Drawings. J. clin. Psychol., 1960, 16, 204-208. 

6. Kopprrz, EvizapetaH M., Marpis, VERDENA, and STEPHENS, THOMAS. Screening School Bogin- 
ners with the Bender Gestalt Test and the Lee Clark and the Metropolitan Readiness Tests. Un- 
published manuscript. 


A VALIDATION STUDY OF MEASURES OF FLATTENED AFFECT ON 
THE BENDER-GESTALT TEST 


WILLIAM M. PRADO,’ DOUGLAS A. R. PEYMAN*® AND OLIVER L. LACEY 
University of Alabama 


PROBLEM 


This study is an effort to validate the assertion that certain signs on the Bender- 
Gestalt Test (B-G) are indicators of ‘‘flattened affect” ®: +. ©. The term “flattened 
affect’’ pertains to what Noyes’? regards as emotional dulling, and what Henderson 
and Gillespie term apathy. They indicate that individuals with “flattened affect’”’ 
show an absence of emotional responsiveness and lack of feeling tone. Such emotions 
as grief, elation, anger, dejection, amusement, joy, etc. are absent from the in- 
dividual’s repertoire of responses. The condition of ‘‘flattened affect” is prominent 
in schizophrenia. 

Boring and Topper, Halpern“), and Hutt“ have mentioned that flattened 
curves and/or decreased angulation for B-G designs are indicators of ‘flattened 
affect’’. If this is so, then objective measurements of the B-G designs should indicate 
significant differences in curvature and angulation between groups of individuals 
diagnosed as having and not having ‘“‘flattened affect’. 


PROCEDURE 


Of the 120 S’s, 40 were flattened affect psychotics (FAP), 40 were non-flattened 
affect psychotics (NFAP), and 40 were firemen (F). Two control groups were used. 
The NFAP group served as the institutional control for the FAP group, and the F 
group was the non-institutional control for the NFAP and FAP groups. A definition 
of flattened affect was verbally presented to eight psychiatrists. Each psychiatrist 
diagnosed the patients in his charge with regard to his knowledge of the patient and 
with reference to the definition offered. Only those patients were selected for the 
FAP and NFAP groups where there was no indecision as to classification. 


1Now Clinical Psychologist at V. A. Rona. North Little Rock, Arkansas. 
2Now Chief Psychologist, Bryce Hospital, Tuscaloosa, Ala. 
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A review of the literature indicated that it was necessary to take into account 
several extraneous variables, which if not controlled may result in measurements 
reflecting the influence of variables other than that of flattened affect. The extran- 
eous variables considered were intelligence®’, education®, visual and motor 
difficulties“, brain damage“: age®, handedness“, and shock treatment‘: 
10, 11, 12) 

The 8’s were of both sexes and with education between one and four years of 
high school. Intelligence was assumed to be within the average range because of 
their education and not having failed to be promoted. Visual and motor functioning 
was unimpaired by such factors as cataracts, strabismus, tics, tremors, etc. They 
had not had syphilis, meningitis, encephalitis, brain tumors, lobotomy, arterioscler- 
osis, head injuries or other conditions which might have affected the central nervous 
system. Their ages ranged from 20 to 45, they were right-handed, and they had not 
had any form of shock treatment for at least four months. A white population was 
used because of availability and the lack of B-G data for a non-white group. 

The patients were administered the B-G within two days of being diagnosed as 
FAP or NFAP. The B-G designs were placed parallel to the top of the record sheet 
and reproduced free hand. The techniques of measurement used were formulated 
before any S’s protocol was examined. The protocols of the three groups were 
randomly mixed so that it was not possible to ascertain to which group any protocol 
belonged while the measurements were being made. 

Angulation measurements were obtained for designs 2, 3, 5,6, and 7. Curvature 
measurements were obtained for designs 4, 5 and 6. Several of Billingslea’ s®) factors 
of measurement were used. The angulation and curvature measurements are dia- 
grammatically represented in Figure 1. Scores less than those indicated were con- 
sidered a decrease in angulation and curvature. 


Fiaure 1. ANGULATION AND CURVATURE MEASUREMENTS FOR B-G Desicans 
ANGULATION 


2. MEAN ANGLE 
TURE 
er on s 


Angulation Measurements. Design 2 (mean angle A). The intersection of the line of best fit 
through the center of the top row circles (line ine AB) with the lines of best fit through the center s 
each gro’ re, three circles in the vertical plane (lines AC, DE, FG, etc.) forms angles BAC 
wet BFG, etc. The sum of these les divided by n represents mean angle A, which was 64 

This measurement corresponds to Billingslea’s factor of Sloping Angulation, Index 2E. 

Tide 2 (angle B). The distance from - center of the first circle on the Gove row (point A) 
pe ds sagpiy: Lager sheet was measured. A second point was marked equidistant from the top 
of the page on the other side of the record sheet. These two points define cline parallel to the to; 
of the page. The e formed between this line and line AB represented angle B. Any line AB 
above the horizontal was recorded by positive , degrees ; deviations below Marie negative degrees. 


measurement appears related to ’s factor of Directional I 

Design 3 (angles C, D, and E). Lines of best fit were drawn from points I . abt Gace: 
each group of dots on both sides. Angles C C, D, and E were respectively 55 , 53, and 53 53 degrees. 
Billingslea no reference to this measurement. 


Whe 
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Design 5 (angle F). A line of best fit (line LM) was drawn through the handle of the design 
and extended all the way through the a. Another line was drawn through the two end dots of 
the curve (line NO). The intersection of the extended line LM with line NO formed angle F, 
which was 48 degrees. This measurement corresponds to Billingslea’s factor of Sloping Angula- 
tion. 

Design 6 (angle G). Using a maximum point of the first horitontal wave on the left side 
(point P) as a fixed point, a line of best fit was drawn through all of the maximal points of the 
horizontal waves (line PQ). For the vertical waves, point K was the fixed point of the line of best 
fit to be used to join the maximal points of the vertical waves (Line RS). The intersection of 
lines PQ and RS at point T formed angle RTQ or angle G, which was 65 degrees. This measure- 
ment is congruent with Sloping Angulation. 

Design 7 (angle H). A line was extended through the two extreme lengthwise vertices of the 
vertical hexagon (line UV) and the other hexagon (line WX). The intersection of lines UV and 
WX formed angle H, of 36 degrees. This measurement is congruent with Sloping Angulation. 


Design 7 (angle I). The two end points for the right side of the left hexagon and the left side 
of the right hexagon were used as fixed points for drawing two intersecting lines. The angle 
— by these two intersecting lines (angle I) was 35 degrees. Sloping Angulation is involved 

ere. 


Curvature Measurements. Ail of the curvature measurements employed the same technique. 
A straight line was drawn connecting the minimal points of both ends of a curve (baseline). After 
selecting the maximum point on the curve, a perpendicular line was drawn to the base line. 
Midway along the perpendicular line another line was drawn parallel to the base line just to the 
sides of the curve. The ratio of half the perpendicular to the latter parallel line was the curva- 
ture measurement used, the units being in millimeters. This measurement corresponds to Billin 
lea’s factor of Shape Distortion. Instead of using his proposed method, utilizing the ratio of the 
height to the width of the curve, the particular ratio used here was considered to be a more 
sensitive measurement of the flatness of a curve. Billingslea’s height to width ratio does not dis- 
criminate between curves of different shapes (e.g., leptokurtic and platykurtic) having the same 
height and width, whereas, the ratio used in this study does. 

Design 4. AB is the baseline, D is the maximum point and DC is the perpendicular line. The 
ratio for this design was DE/GF and was .38 mm. 

Design 5. NI is the baseline, J is the maximum point, and JO is the perpendicular line. The 
ratio for this design was JM/KL and was .37 mm. 

Design 6. Each curve of the horizontal (H) and vertical (V) waves was measured separately. 
The mean curvatures core for each wave was computed in a similar fashion as indicated above. 
The mean H-wave measurement. was .28 mm and the mean V-wave measurement was .27 mm. 


RESULTS AND Discussion 


The mean angulation and curvature scores for the three groups are shown in 
Table 1 with an analysis of variance. 


TABLE 1. MBAN ANGULATION AND CURVATURE SCORES FOR THE FAP, NFAP anp 


Groups 
Angulation Score Group 
FAP NFAP F F Ratio 
Angle A 74 72 2.82 
B .93 AT 2.87 
Cc 52 56 47 
D 54 54 49 
E 59 51 
C+D+E 159 169 147 1.98 
F 52 48 47 2.70 
G 73 74 71 1.43 
H 35 34 36 
I 38 37 38 
H+I 73 71 74 6.39 
Curvature Score 
ign 4 .87 -40 40 1.19 
5 41 43 1.59 
.32 .33 29 
+V)  .59 62 54 2.44 


*None of the F ratios were significant, P > .05 
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The F ratios of Table 1 indicate that the mean angulation and the mean curva- 
ture scores of the FAP, NFAP, and F groups do not differ significantly from one 


another. The B-G has not, in fact, distinguished between flattened affect and non- 
flattened affect groups. 


SUMMARY 


Flattened curvature and/or decreased angulation on certain designs of the 
Bender-Gestalt Test have been suggested as indicators of ‘‘flattened affect’’. The 
present study was designed to investigate whether or not such a relationship exists. 

Three groups of subjects, each composed of forty individuals, were enlisted for 
this study. Two of the groups were from state mental hospitals, being psychiatrically 
diagnosed as psychotic and divided into a ‘flattened affect” or ‘“‘non-flattened 
affect” category. The third group consisted of firemen, regarded as the “normal”’ 
group. The record of each subject given the Bender-Gestalt Test was scored accord- 
ing to prescribed methods. No significant differences were found among the mean 
angulation and curvature scores of the three groups for any of the figures in this 
study. The Bender-Gestalt Test does not appear to distinguish adequately between 
“flattened affect” and ‘‘non-flattened affect” groups. 
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A COMPARISON OF BENDER GESTALT REPRODUCTIONS OF 
RIGHT AND LEFT HEMIPLEGIC PATIENTS! 


SAMUEL HIRSCHENFANG 
Kings County Hospital Center, Brooklyn, New York 


PROBLEM 

This study investigates the relationship be- 
tween on the Bender Gestalt) of 
two groups o ht and left hemiplegic patients. 
Tolor® a significant differences in re 
between organics, with and without convulsive 
disorders, and patients with non-psychotic 
disorders. Reznikoff and Olin“ 
attem to determine whether schizophrenia 
could “t differentiated from a group of organic 

tients in — ability to recall the figures of the 
Bender Gestal t. Karlin and Hirschenfang®) 
have a that left hemiplegics demonstrated 
more disturbance in visuospatial discrimination, 
in spite of little visual agnosia, as compared to 
right hemiplegic patients. The purpose of this 
study is to determine whether brain damage in 
the dominant or non-dominant hemisphere 
effects the performance on the Bender Gestalt, 
and to what extent. 


METHOD 


The Bender Gestalt was administered and 
scored employing Pascal] and Suttell’s“) method 


though there was no be- 
tween groups on -~ & the A Mann 
Whitney U Test, yielded a p Pe 32, using a two- 
tailed test of significance. 

To determine the significance between the 
total standard scores of the right and left hemi- 
plegics, a Mann Whitney U Test, yielded a 

p < .001 indicating that the left hemiplegics do 
Goutheantly poorer than the right hemiplegics. 


SuMMARY 


The Bender Gestalt Test was administered to 
two groups of 25 right and 25 left hemiplegic 
patients. The left hemiplegic patients did sig- 
nificantly poorer than the right hemiplegic 
patients. 
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1. Comparison or Riaut Lerr Hemrpiecic ScoRES ON THE BENDER GESTALT 


legic Left Hemiplegic 
(N= 25) 

Figure Mean 8.D. | Mean 8.D. r P 
1 6 6.60 10 9.11 75 .33 
2 8 6.52 12 9.32 55 .54 
3 10 9.0 15 9.49 .69 .40 
4 13 5.83 20 9.29 .73 .35 
5 8 4.70 12 6.77 52 .56 
6 12 15 8.38 .64 .48 
7 14 7.61 19 8.75 58 51 
8 13 5.51 17 6.60 71 37 
Configuration 8 4.71 14 7.65 42 36 
Total 128 40.98 175 43.53 76 <.001 


to two groups of 25 institutionalized hemi- 

plegic patients each, including 25 right hemi- 

Plegi cs, mean age 56 years, range 22-78 years 
25 left hein, mean age 60 years 


‘a 46-83 years. A two tailed test (p = 17) 


indicated that a difference between the ages of 
both groups is not The first grou 
consisted of right he ——_ patients with 
cerebral lesions in the dominant hemisphere. 
The second group consisted of left hemiplegic 

tients with cerebral lesions in the non-dom- 
inant hemisphere. All of the patients sustained 
a cerebrovascular stroke (due to thrombosis, 
embolism, or hemorrhage). 


REsULTs 
In Table 1, the group of left be pe c patients 
do less well than the grou piegicpe ee 
mean scores on 


3. Pascauy, G. and B. The Bender 
— Test. New York: Grune and Stratton, 
4. Reznikorr, M. and T. D. Recall of 
the Bender-Gestalt designs by organic and 
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PSYCHOLOGICAL RESPONSE TO CHLORPROMAZINE 
IN A GROUP OF PSYCHIATRIC PATIENTS! 


LONNIE E. MITCHELL AND MELVIN ZAX?” 
Veterans Administration Mental ‘. Clinic, University of Rochester 
Washington, D. C. 


PROBLEM 


Clinical observations suggest that the dramatic behavioral changes often 
brought about by ataractic drugs are the result of modifications of the affective 
state of the patient rather than personality reorganization. The purpose of this 
study is to assemble a battery of psychological tests with which to measure both some 
relatively superficial affective states as well as more enduring features of personality 


and to administer it to a group of hospital patients before and after a course of chlor- 
promazine treatment. 


METHOD 

A total of 69 psychiatric patients at St. Elizabeths Hospital, Washington, D. C. 
served as subjects (Ss). The characteristics of these Ss and the criteria for selecting 
them are reported elsewhere“. Ss were divided between an experimental group of 
36 and a control group of 33, equated for age, sex, length of hospitalization, and de- 
gree of anxiety as judged clinically by the referring psychiatrist. 

The test materials were three scales constructed to measure various aspects of 
anxiety plus two which are considered to measure more enduring personality traits. 
The measures of anxiety were Taylor’s Manifest Anxiety Scale ©’, the Winne Neuro- 
ticism scale“ and the N scale constructed by Guilford and Martin®?. The other 
two devices were Guilford’s S factor which is a measure of social introversion-extra- 
version and his R factor which involves fun-loving and freedom from care“. The 
Manifest Anxiety Scale, the Neuroticism Scale and the N scale were completed by 
all a Only 32 experimental and 26 control Ss were able to complete the S and R 
scales. 

All Ss selected to take part in the study were tested and retested approximately 
30 days later by one of the authors (L. M.). Following the first test, Ss were assigned 
by two collaborators to either the experimental or the control group on the basis of 
the matching variables. Experimental Ss were given chlorpromazine for a 30 day 
period with dosage and method of administration regulated by the psychiatrist in 
charge of the case. Typically, after a week or 10 days of gradual increase most Ss 
were receiving between 200 and 600 mg. (orally) daily. Control Ss, of course, re- 


ceived no ataractic drugs. 
RESULTS 


In order to establish that the groups were equated on test scores before the drug 
was administered, ‘‘t’” ratios were computed comparing mean scores of experimental 


TABLE 1. INTERCORRELATIONS OF PRE-TREATMENT TEsT SCORES 


Taylor Winne N 8 R 
Taylor Scale — .80 — .004 
Winne Scale —.70 .54 .02 
— .60 04 


R 


‘This paper was based on one read at the 67th meeting of the American Psychological Association 
in Cincinnati, Ohio, September 1959. 

2The authors are grateful to the staff and patients at St. Elizabeths Hospital, Washington, D. C. 
whose cooperation made this study possible. A special debt of gratitude to Dr. A. D. Annis for the 
part he played in planning and carrying out the project is also acknowledged 
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and control Ss on all five tests; all were insignificant. Intercorrelations between test 
scores were computed to determine the extent to which the tests measured separate 
functions, and are reported in Table 1. 

The highest correlations in Table 1 are among the three “anxiety” scales. At 
the same time, the S scale is substantially correlated with each of the anxiety scales 
while the R factor shows little relationship to them. This would suggest that if 
chlorpromazine does effect a significant change in anxiety scale scores, one might 
expect a concomittant change in the S factor since it is related to these scales. Mean 
differences between pre and post treatment scores were determined for each group 
and compared by means of “‘t’’ tests with results reported in Table 2. 


TasB.e 2. Tests oF SIGNIFICANCE OF THE DIFFERENCES BETWEEN PRE AND 
Post-TREATMENT Test SCORES FOR THE EXPERIMENTAL AND THE CONTROL 
Groups. NEGATIVE Signs DENOTE A LOWERING OF THE TEsT ScoRE. 


Scales & Groups Mean Variance t P 


Taylor Scale 
rimental —5.14 163.81 
Control —1.97 87.15 


Experimental —4.64 44.12 
Control P —0.85 28.50 


5.61 108.73 
0.79 50.16 


-2.56 108.35 
—2.27 65.32 


Experimental 32 3.66 99.99 
Control 26 1.92 60.25 0.74 N.S. 


*Cochran-Cox test —s to test for — of differences between two 
means when population variances differ “, 


The analyses indicate that after a 30-day regimen with chlorpromazine Ss show 
a significant drop in scores on the Winne scale and a significant increase in their 
scores on the N factor as compared to a control group. Both changes reflect a lowered 
anxiety .evel as measured by these particular tests. The Taylor scale difference 
scores likewise reflect a greater drop of anxiety level in experimental Ss as compared 
to controls but these differences are not significant. This appears to be largely a 
function of the fact that in the case of the Taylor difference scores the variability is 
quite high. The difference scores of both groups on the S scale are quite similar and 
insignificant. The slight differences found between groups on the R factor likewise 
fail to approach a significant level. 


Discussion 


Within the limitations set by the measures used, these findings support the oft- _ 
made clinical observation that the behavioral changes seen to follow treatment with 
chlorpromazine are primarily the result of a change in anxiety level and not profound 
modifications in life style. Changes in R factor scores were insignificant as were 
those of the S factor both of which probably measure more enduring features of 
personality than do the anxiety scales. Difference scores in the S factor were non- 
significant in spite of the fact that it measures some functions similar to those 
assessed by the anxiety scales. To account for the failure of this factor to reflect the 
same changes as the anxiety scales one must assume that test elements other than 


those related to anxiety per se, presumably introversive or extraversive traits, were 
unaffected by the drug. 


1.18 N.S. 
Winne Scale ‘ 

rimen 
Control 33 

rimen 
R Scale 


LONNIE E. MITCHELL AND MELVIN ZAX 


SUMMARY 


Two groups of patients, hospitalized for emotional disorders, were equated for 
age, sex, length of hospitalization and degree of anxiety as judged clinically by the 
referring psychiatrist. One group was treated with chlorpromazine for 30 days while 
the other, acting as a control, experienced routine hospital care but did not receive 
treatment with ataractics for an equivalent period. Both groups were tested before 
and after the 30-day period with a battery of five psychological tests. Three were 
essentially measures of anxiety while the other two attempted to assess personality 
traits of a more enduring nature. It was found that after chlorpromazine treatment 
there was a significant change in the expected direction in the scores of two of the 
three measures of anxiety. On the other hand, changes on the tests of more enduring 
personality characteristics were insignificant. 
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CHARACTERISTICS OF PSYCHIATRIC OUTPATIENTS 
RECEIVING TRANQUILIZERS 


MAURICE LORR,! DOUGLAS M. MC NAIR,! AND STANLEY B. RUSSELL” 
Veterans Administration, Washington, D. C. 


A survey was recently conducted in all Veterans Administration Mental Hy- 
giene Clinics to obtain data concerning the prevalence, judged benefits and effects 
of tranquilizer drugs in treating psychiatric out-patients. The present report repre- 
sents a study of limited aspects of the larger survey®. Specifically, the aims of the 
study were: (a) to compare clinic patients who received tranquilizers as the primary 
treatment with those who received psychotherapy as the primary treatment; (b) 
to ascertain the relation of social class level to treatment of out-patients; and (c) to 
summarize what physicians and psychotherapists consider the indications for and 
benefits of tranquilizers. 


METHOD 


Data Collection. Sixty-three clinics, widely distributed over the entire United States, 
were asked to describe representative samples of both open and closed cases on a 
Patient Treatment Record.’ First, all cases who terminated treatment for any 
reason during the month of September 1957 were described. Second, a random 
sample was taken of all cases open at the end of September 1957 who had received 


1From the Veterans Administration, Veterans Benefits Office, Washington, D. C. 

2From the Veterans Administration, Central Office, Washington, D. C. 

*Acknowledgement is made to Martin M. Katz, Psychopharmacology Service Center, National 
Institute of Mental Health, Bethesda, Md., for assistance in constructing the Patient Treatment 
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any treatment. Open cases were defined as those which were either active or in- 
active but not terminated. Beginning with the first case in the open file, each third 
case was included in the survey. 


The Treatment Record. The Patient Treatment Record included data concerning 
such patient variables as sex, age, highest level of schooling completed, primary and 
secondary clinic diagnosis, reported symptoms and complaints, length of usual treat- 
ment interview, and scheduled treatment frequency. The principal therapeutic pro- 
cedure and any adjunct treatment procedures were noted for each case. For all 
patients given medication, type of drug, daily dosage, number of weeks treated, and 
any side effects noted were indicated by the prescribing physician. Finally opinions 
were requested concerning the indications for drug treatment, the judged beneficial 
and adverse effects, and attitudes toward the therapeutic value of tranquilizer drugs. 


Method of Analysis. For purposes of this analysis each case was categorized by the 
principal planned therapeutic procedure as indicated on the Patient Treatment 
Record by the therapist working directly with a case. The treatment groups of 
interest here were the following: (a) individual or group psychotherapy as the princi- 
pal treatment procedure with no adjunct treatment; (b) individual or group psycho- 
therapy as the principal treatment procedure with tranquilizers as adjunct treat- 
ment; (c) tranquilizers as the principal treatment procedure with no adjunct treat- 
ment; (d) tranquilizers as the principal treatment procedure with individual or 
group psychotherapy as the adjunct treatment. Cases for whom the primary treat- 
ment procedure was neither psychotherapy nor tranquilizers were not included in the 
present analysis. The statistical treatment of the data consisted of x? and t-tests of 
significance as appropriate. 


RESULTS AND Discussion 


Treatment Group Differences. :-An examination of the four treatment groups with 
respect to age revealed no significant differences. For all groups the modal age is 37. 
However, as shown in Table 1, the treatment groups do differ as to the highest level 
of schooling completed. This variable was introduced as a crude index of social class. 
When psychotherapy is the principal treatment procedure, 50 per cent of the pa- 
tients are high school graduates or better. In contrast, the group who received tran- 
quilizers as the primary treatment are about equally distributed at all educational 
levels with high school graduates constituting only 30 per cent. The high school 
= thus, has a 50-50 chance of being assigned to individual or group psycho- 
therapy. 

The four treatment groups also differ significantly in percentages of cases diag- 
nosed by the clinic as neurotic, psychotic, psychosomatic, or as personality disorder. 
As may be seen from Table 1, the percentage of diagnosed neurotics is greatest when 
psychotherapy is the principal treatment method and least when tranquilizers are 
primary. The percentage of psychotics, on the other hand, tends to be greatest for 
the group treated principally with tranquilizers. There are no significant differences 
between the groups with respect to other diagnoses. 

Data on interview length in relation to type of treatment are presented in Table 
1. As might be expected, the length of the therapeutic interview is strikingly related 
to type of treatment. When psychotherapy is primary at least three-fourths of the 
patients receive therapeutic interviews over 30 minutes in length. When the thera- 
pist considers tranquilizers as the primary treatment, his interviews tend to be less 
than 30 minutes in length. The categories of treatment are also importantly related 
to scheduled treatment frequency as shown in Table 1. When psychotherapy is the 
principal treatment procedure, 50 per cent of the cases are scheduled for treatment 
once a week or more. In contrast, some 95 per cent of cases treated primarily with 
drugs are scheduled for treatment twice a month or less or on an irregular basis. An 
examination of the length of the patients’ most recent treatment course also reveals 
certain differences. When psychotherapy is primary, 45 per cent of such patients 
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have been in treatment for over a year. Only 32 per cent of the cases treated princi- 
pally with tranquilizers have been in treatment for more than a year. It may be that 
cases in which psychotherapy is primary are kept longer by their therapists or that 
they tend to remain longer than other patients. Another possible explanation is that 
patients did not receive tranquilizers prior to about 1953. 

It is worthy of note that when psychotherapy is the primary treatment, the 
differences between patients receiving or not receiving tranquilizers tend to be small 
or negligible. Seven per cent more patients receiving tranquilizers are diagnosed 
psychotic, four per cent fewer have interviews of 30 minutes or longer, and seven per 
cent fewer are scheduled for once a week or more treatment. There are no significant 
differences with respect to age, highest grade completed, or weeks in treatment. 
Since length of the treatment interview and scheduled treatment frequency are not 
patient characteristics, the diagnosis alone indicates how patients administered 
tranquilizers as an adjunct to psychotherapy differ from those whose treatment is 
restricted to psychotherapy. — 

To secure further information on possible differentiating variables, small 
sample comparisons were made comparing patients with and without drugs who were 
scheduled for psychotherapy once a week or more. It was found that patients who 
received drugs were rated as exhibiting a significantly greater number of anxiety 
symptoms on a 13-item checklist as compared to those who did not receive tran- 

uilizers. This finding is supported by therapists’ free comments indicating why 
p on therapy was initiated. Apprehension, anxiety, and tension were the most fre- 

uent reasons given. These results suggest that if psychotherapy is the treatment of 
choice, tranquilizers are prescribed to psychotic and excessively anxious patients. 


Social Class Differences. Hollingshead and Redlich“? have reported that social class 
level is strongly related to incidence of psychiatric disorder. Psychoses predominate 
in the lower social classes while neuroses are prominent in the middle and upper 
social classes. In general, psychotherapy is the treatment of choice for upper and 
middle class patients as contrasted with organic treatment for lower social class 
patients. Furthermore, middle and upper class patients remain in treatment longer. 
These findings suggested that the variables of length of the treatment interview, 
scheduled treatment frequency, and length of treatment course might relate to 
social class in the VA Mental Hygiene Clinics—as Winder and Hersko™? found in a 
study of the caseload of a single clinic. It also seemed likely that diagnosis and type 
of treatment would relate to social class. 

The measure of social class available in the survey was highest grade level 
completed. This variable, which has high correlations with other indices of social 
class, such as size of income and occupational level, was used as the social class index 
in the present report. As reported previously, type of treatment is substantially re- 
lated to educational level (Table 1). However, tests of significance showed that 
primary clinic diagnosis is not related to educational level in the present population. 
On the other hand, length of the treatment interview and scheduled treatment fre- 
quency are substantially related to.education. Only 27 per cent of high school grad- 
uates have treatment interviews of 15 minutes or less as compared to 73 percent of 
non-high school graduates. As far as scheduled treatment frequency is concerned, 62 
per cent of high school graduates are assigned to once weekly or more frequent treat- 
ment, while only 38 per cent of non-graduates are assigned to this treatment fre- 
quency. Evidently social class level, as measured by highest educational level com- 
pleted, is an important determiner of the type of treatment assigned to a patient, 
and consequently to his scheduled treatment frequency and the length of his inter- 
view. Diagnosis itself is unrelated to education. 

Is social class related to improvement? Therapists’ comments on the ways in 
which drugs benefited patients or therapeutic procedures were available on the 
Patient Treatment Record. These comments were categorized as indicative of im- 
provement, no change, worsening, or of mixed adverse and beneficial effects on a 
random sample of patients who were being seen in psychotherapy once a week or 
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more and who also were receiving tranquilizers. A comparison of a group judged to 
have improved with a comparable group who showed no change or became worse 
revealed interesting differences. The results indicated, at a significant level, that the 
lower the educational level the greater the judged therapeutic response to tranquil- 
izers. Possibly, then, lower social class patients respond better to organic treatment 
than middle class patients. 


Treatment Indications and Benefits. Some 13 types of tranquilizers were prescribed 
to clinic patients. Meprobamate and chlorpromazine were clearly the most common- 
ly used tranquilizers in the VA Mental Hygiene Clinics. Reserpine ranked a poor 
third with respect to usage. Whenever drugs were prescribed during the course of 
treatment, the therapist was asked to give the indications for initiation of such 
treatment. Apprehension, anxiety, and extreme tension were the most commonly 
reported indications for treatment. Next in frequency was hostility. Other indica- 
tions, usually considered related to or resulting from anxiety or tension, were head- 
aches, sleeplessness, and irritability. Some therapists reported use of tranquilizers to 
“develop a positive therapeutic relationship” and to “‘make the patient more acces- 
sible to psychotherapy.” 

The benefits reported by therapists usually related directly to the reasons for 
initiation of treatment. Illustrative of the benefits reported were the following: 
“relieved anxiety and apprehension,”’ ‘‘patient gained feeling of support from ther- 
apist,” “assured patient that the therapist was interested in him,” “better able to 
become involved in therapy,” ‘‘increased performance on the job,” “increased cap- 
acity for self-examination,” ‘reduced psychotic ideation,” “less irritable and less 
angry.” Except for side effects, very few adverse effects on the patient or the course 
of psychotherapy were reported. Some patients felt a lack of self-control. Therapists 
occasionally noted lethargy, or indicated that the patient no longer talked about his 
problem. 


SUMMARY AND CONCLUSIONS 
In summary, the results of our analysis of the survey data indicate the following: 


1. Compared with patients for whom psychotherapy is primary, patients 
whose principal planned therapeutic procedure consists only of tranquilizers are 
high school graduates significantly less frequently, and are diagnosed psychotic 
significantly more frequently. 

2. Patients whose principal planned therapeutic procedure is psychotherapy 
plus tranquilizers as adjunct treatment are significantly more frequently diagnosed 
psychotic, and are rated as significantly more anxious than patients who receive only 
psychotherapy. 

3. The higher the educational level of the patient the more likely that he will 
be scheduled for psychotherapy once a week or more, that his interview will be 30 
minutes or longer, and that he will be in treatment for over a year. 

4. Toa significant degree, patients who are in psychotherapy once a week or 
more and who receive tranquilizers are more likely to be judged improved if they 
have little educational background than if they are well educated. 

5. Therapists agree that anxiety, tension, and hostility are the indications 
used to initiate tranquilizer treatment. 
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EFFECT OF ATARACTIC DRUGS ON SUCCESS OF TRIAL VISITS OF 
LONG TERM PATIENTS 
ROBERT F. MUNSON, W. ARTHUR THEOBALD AND RICHARD M., GRIFFITH 
Veterans Administration Hospital, (Lexington, Ky.) and University of Kentucky 


The immediate effect of ataractic drugs within 
the mental hospital was an impressive improve- 
ment in the condition of some long term patients. 
Some of those, for whom little clinical hope had 
been held, were brought to the point of leaving 
the hospital. Did these patients maintain their 
apparent clinical gains or did they eventually 
return to the hospital? Ellsworth and Clayton “) 
report no significant difference between the 
course of hospitalization (length of stay or return 
rate) of two newly admitted groups of patients— 
one group admitted during the period of a year 
before the drugs were introduced, the second dur- 
ing a a when chemotherapy was used in- 
tensively. The present study, conducted in a 
comparable neuropsychiatric h 
evidence on the return rate of ¢ 
who received ataractic drugs. 

Introduced into the hospital in 1954, the 
ataractic drugs were first in he in May 
1955. From July, 1955 through June, 1956 7 
patients who pod been continually 
hospitalized for two years or er were rele 
on trial visit (TV) status. Fifty-six of these had 
spate some dosage of tranquilizing drugs (the 

); thirty-one, who had received none, 

y July, 1957 every TV had terminated—suc- 
cessfully, it was assumed, if the patient had been 
discharged, or unsuccessfully, if he had returned 
to the hospital. 


ital, furnishes 
‘onic patients 


The salieut who had not received drugs were _ 


more successful in the outcome of the TV’s. 
Fifty-seven percent of the Rx group returned to 
the hospital vs. 32 percent of the non-Rz. The 
difference is statistically significant by Chi Square 
at the .05 probability level. 


group corresponds closely to the figure cited by 
worth and Clayton“) for chronic patients 
and also, according to the records of the Social 
Work Service, to the expectations within this 
hospital. Thus, by all standards, the rate of re- 
lapse in the Rz group was excessively high. 

The results might have been different had the 
Rz patient continued hospital d of the 
tranquilizer while on TV. Tn the early days of 
chemotherapy regulations were cautious = 
scribing minimal dosages—usually a nell 
tion of the amount which had walt about the 
clinical improvement—when the patient was no 
longer under close medical supervision. Even 
this medication was discontinued by = 
patients when on their own. Or, pe ays 
patient had been more seriousl tur 
gin with than the non-Rz. owever, hg a 
proaches do not provide evidence for any differ- 
ence in the degree of disturbance: (a) The average 
length of had nearly the 
same for both groups (the Rz averaged 66 
months; the non-Rz 69 months—previous hos- 
pitalizations not hon considered). (b) Previous 
(unsuccessful) 7V’s had been gran to equal 
percents (41 vs. 42) of the two groups. Finally, 
to account for the lack of success of the 
patient selected for TV, one may suspect an early 
over-optimism in a new treatment modality 
which promised much, partly because it was 
needed so badly. 
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RELATIONS BETWEEN THE INTRACLASS CORRELATION AND THE 
KUDER-RICHARDSON RELIABILITY FORMULAS 


MAURICE LORR 
Veterans Benefits Office, Washington, D. C. 


The intraclass correlation coefficient based upon analysis of variance concepts 


is now widely used to secure estimates of inter-rater reliability. Ebel®) proposed 
use of this index and indicated its relation to other estimates of reliability such as 
the product-moment correlation, average inter-rater correlation, and Horst’s 
generalized reliability coefficient. Earlier Hoyt also developed an estimate of re- 
liability based on analysis of variance which he indicated was the same as formula 
# 20 of the Kuder-Richardson? series. It can be shown that Ebel’s intraclass for- 
mula for reliability of an average rating, Hoyt’s formula, and Kuder-Richardson 
(K-R) #20 are identical. While this relation has been demonstrated in several re- 
ports, no published study is available which presents more broadly the relations of 
the intraclass correlation to both of the K-R formulas #20 and #21, for continuous 
as well as for dichotomous variables. Furthermore, relatively few workers with and 
without a major interest in test theory seem aware of these relationships. For 
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example, Guilford’s text) refers to these procedures as alternative approaches to 
estimating reliability. 

To fill these gaps, this report summarizes some of the relations of intraclass r to 
K-R formulas for continuous as well as discontinuous variables, points out the role of 
observer error, and presents a series of simple computational formulas involving 
pone ga and sums of squares. Use of the formulas is illustrated with several ex- 
amples. 

Suppose N persons are rated by n raters or observers. Let X, be the rating 
given the i th person by the j th rater wherei = 1,2...N,andj = 1,2...n. Let 
X,. be the sum of ratings for person i over all raters; X., be the sum of ratings given 
by rater j to all individuals; and X.. be the sum of all ratings of all individuals by all 
raters. The “‘dot’’ indicates the variable summed—whether over individuals (i) or 
over raters (j). Whenever reference is made to raters and ratings, it should be under- 
stood that the argument applies equally well to test items and test scores. 


TaBLe 1. NovraTIon FoR ANALYSIS OF VARIANCE 


Variance 
Sim of squares df estimate 


N-1 


n-1 


(N-1) @-1) 


2 
Total: S nN Nn-1 


Table 1 represents a two-way analysis of variance table in which the rows repre- 
sent persons and the columns represent observers. Then the intraclass reliability for 
an average rating is given by 
= (1) 

where V, is the estimate of person variance and V, is the estimate of error variance. 
The reliability of an individual rating may be estimated by substituting r, and the 
value for n in the Spearman-Brown formula and solving for r,, the reliability for a 
single rater. 

When the equivalents of V, and V, from Table 1 are substituted in (1) and 
reduced, we have 

8, s, 
(2) 
n- 1 8, 

Note should be made, by reference to Table 1, that the observer sum of squares 

is excluded (subtracted) from the error sum of squares. 


= 


RELATION OF INTRACLASS Tr TO K-R #20 
If the scale variables are continuous, K-R #20 “ may be expressed as 
n op? — Zo? 


n-1 (3) 


Here a,” is the between person variance and Yo;*, the sum of rater variances. Now 
it can be shown, using the definitions given in Table 1, that o,? equals n/N S,, and 


T200 = 


Between 

persons: S, D nN iz Vp 
Between x? x? 

Residual or 
error: S, 8 = 8, -8, -8, 
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Zo,’ equals 8, -S,/N. If these equivalents are substituted in equation (3), the result 
is equation (2) for the intraclass r. Thus K-R #20c and the intraclass correlation 
in which the observer sum of squares is excluded are formally the same. 

When the scales are dichotomous and ratings are zero or one, an observer's 
variance becomes pq where p is the proportion of persons rated one and q is the 
proportion of persons rated zero. Thus Yo;? becomes Ypq in formula (3) which is 
then the familiar dichotomous form of K-R #* 20d. The use of this form for dicho- 
tomous judgments of the presence or absence of a trait seems not to have been sug- 
gested before. 

Tue Errect oF OBSERVER VARIANCE 


In Taste (2) differences among observers are excluded. As Ebel“) has in- 
dicated, if decisions are made by comparing averages of complete sets of ratings from 
all observers then observer differences should be removed from the error sum of 
squares. However, if decisions are made in practice by comparing single ratings 
assigned to different persons by different raters then the observer sum of squares is 
properly included in the error term. Similarly if only incomplete sets of ratings are 
available rater differences should be included. 

Since the between observer variance (or variance of observer variances) plays 
an important role in the estimation of reliability, it is of some interest to show its 
influence explicitly. This iuay be accomplished by a few substitutions in equation 
(2). The total sum of squares can easily be shown to equal nNo;,’ the total variance; 
and the between observer sum of squares to equal nNo,”, where a,” is the between 
observer variance; and as indicated earlier N /n o,* equals 8,. If these equivalents 
are substituted for the various sums of squares in (2), the resulting equation is 

— (neo? — no,*) 
= n-1 (4) 


If the ratings are dichotomous in form then ne,’ in equation (4) reduces to M — 
M?/n where M is the mean of all ratings in the table. A comparison of (4) with (3) 
shows that the sum of observer variances, 2o;*, actually consists of the total variance 
with the between observer variance excluded 

Now suppose the observer sum of squares is included in the error sum of squares, 
so that S, in Table 1 equals 8, -S,. The second term in the numerator of (2) then 
reduces to 8,/n. If S, in (2) is replaced by N /no,? and §,/n is replaced by No,’, 
this a for the intraclass correlation reduces to K-R ¥#21c for continuous 
variables 


n op — hoe 


= n- 1 oy (5) 


Since for dichotomous ratings ne,’ is equivalent to M — M?/n, equation (5) with this 
last substitution is the familiar K-R # 21d for dichotomous variables. 

Thus equations (4) and (5) are identical except for no,”, the between observer 
variance. When the between observer variance is zero, K-R *20 and #21 will be 
the same. Since the error term in #21 is larger than that of * 20, the former will 
always yield a smaller reliability coefficient except when the variance of observer 
variances is zero. 


Tue CoMPpuTATIONAL FORMULA 


Formulas for computing K-R #20 and #21 for continuous (c) and dichot- 
omous (d) variables are given below. Both utilize only sums and sums of squares. 
The first set is the same as those of Hoyt and Stunkard“™?. 


n X?.. NX.. + 
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— nX?.. — + 


nN>,X? — nX?.. 
r n — nX?.. — nNX.. + 
nN>,X4,. — nX?.. 


TaBLe 2. ANALYSIS oF Ratines FoR ConTINUOUS VARIABLE 


Rater 

Person 1 Xi, x? 
1 <3 3 6 36 2X? = 274 
2 1 2 9 ZXj = 586 
3 5 7 12 144 X? = 1,156 
4 4 3 7 49 zEXj = 142 
5 2 4 6 36 
Xj 15 19 34 = X.,, = 841 
225 361 = .766 


Tables 2 and 3 present two illustrative problems, one for dichotomous data and 
the other for continuous variables. It should be noted that in each instance K-R * 20 
yields the larger reliability coefficient. 


TaBLe 3. ANALYSIS OF RatTINGs FOR DicHoToMOUS VARIABLE 
Rater 


1 1 1 1 3 9 2x? 
2 1 0 1 2 4 = 22 
3 1 0 1 1 xX? = 64 
4 1 1 0 2 4 zzXj = = 8 
5 0 0 0 0 0 
Xj 3 3 2 = -462 
2 


Taid - 423 


‘ SUMMARY 
The Kuder-Richardson reliability formulas *20 and #21 have been shown 
to be identical to several forms of the intraclass correlation coefficient. Simple 
computational formulas for dichotomous and continuous variables involving only 
sums and sums of squares were presented, and — on several examples. The 
role and effects of observer (item) variance in the K-R formulas has been highlighted. 
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The present study stems from two previous 
research efforts“: *) dealing with the clinical 
identification of suspected auditory malingerers. 
This brief report deals with the reliability of 68 
MMPI items which were significant at the .05 
level in discriminating between malingerers and 
nonmalingerers. A large sample was selected to 
examine the reliability of the items. 


SAMPLE 

In the present investigation, 65 male suspected 
malingerers were matched with a control sample 
of 65 males on the basis of age and degree of 
organic hearing loss. The suspected malingerers 
showed abnormal threshold fluctuation (elevated 
thresholds 15 db or greater) and considerable 
erent response variability. They were 
carefully screen for psychiatric disorders, 
organicity, aphasia and aural pathology. Each 
oukbest had a hearing loss of sufficient degree to 
interfere with communication. The average 
binaural loss for 500, 1000 and 2000 cycles was 
30 = _ each group with a standard deviation of 
+1 


Ivem RELIABILITY 

An analysis of the responses on the 68 items in 
the preliminary malingering scale showed a 
marked reduction in the number of significant 
items. A total of 19 items was significant at the 
.05 level. The malingerers scored the following 
items as false: 30, 79, 187, 221, 285, 383, 429, 495, 
and 552. Items 11, 67, 157, 325, 344, 354, 356, 
363, 413, and 447 were scored as true by the 
malingerers. The Lawshe and Baker®) method 
for estimating the significance of differences be- 
tween percentages was used to identify the dis- 
criminating items. 

The next step involved the determination of a 
cut-off score for identifying individual cases. 


For the 65 control cases on the 19 items of the 
li 


scale, the mean and median 


FURTHER INVESTIGATION OF THE RELIABILITY OF AN MMPI 
SCALE FOR AUDITORY MALINGERERS 


FRANK KODMAN JR. AND ERNEST MC DANIEL 
University of Kentucky 


4.0, the mode 3.0 and the range 1-11. The sus- 
ted malingerers gave a mean and median of 
.0 and a mode of 9.0. The range was 2-13. A 
cut-off score was determined empirically which 
best identified the cases in each sample. When a 
range of 0-6 was used to identify the normal or 
nonmalingerer and a score of 7.0 or above was 
assigned to identify the suspected malingerer 
our set of items correctly identified 75 percent of 
the cases in the two samples. 


SUMMARY 


The present study was a cross-validation of 
the 68 item MMPI Scale for Auditory Malinger- 
ers. Sixty-five suspected malingerers, (selected 
on the basis of peor response reliability and a 
threshold elevation 15 db or greater on a battery 
of hearing tests), were matched with 65 hard of 
hearing nonmalingerers. The subjects were ad- 
ministered the 68 items of the unvalidated 
malingering scale. A total of 19 significant items 
remained in the scale as a result of the cross- 
validation. These items should be useful in 
clinical or experimental investigations of audi- 
tory malingerers. A cut-off score was determined 


which correctly identified a total of 75 percent of 


the cases in the two samples. 
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ESTIMATING WAIS IQ FROM SHIPLEY-HARTFORD SCORES: 
A CROSS-VALIDATION 


ARTHUR N. WIENS AND WILLIAM H. BANAKA 
Oregon State Hospital 


During the past year and one-half the present 
authors have studied the comparability of Shi 
ley-Hartford total scores and WAIS Full Scale 
IQ. Sines and Simmons“) have recently publish- 
ed their findings of a similar study. Our data now 
aftord an opportunity to cross-validate their re- 
sults. Sines and Simmons reported a correlation 
of .90 between Shipley-Hartford total scores and 
WAIS Full Scale IQ’s. From the regression equa- 
tion derived from the correlation, they ee 
a conversion table for obtaining WAIS Full 
Scale IQ equivalents from Shipley total scores. 

The subjects in the present study (N = 140) 
appear to be generally comparable to the Sines 
and Simmons subjects. They are patients at the 
Oregon State Hospital who were referred for 
psychological testing by psychiatric staft and 
residents during the one and one-half year period 
of this study. Each patient was given the full 
WAIS; in 117 cases this was on the same day that 
the Shipley was given and in the remainder of the 
cases within a day. The diagnostic categories 
were representative of all major classifications: 
psychosis (40), neurosis (9), personality disorders 
(79), mental deficiency (4), and chronic brain 
syndrome (8). There were 97 males and 43 fe- 


males. 
WAIS full scale IQ’s ranged from 67 to 128 


(mean 100.5, sigma 13.0). Shipley total scores 
(corrected for omitted vocabulary items) ranged 
from 13 to 68 (mean 45.0, sigma 13.6). The pro- 
duct-moment correlation between Shipley scores 
and the actual WAIS IQ’s was .80, significant at 
the .01 level. It was further established that the 
regression equation develo by Sines and 
Simmons has a high degree of accuracy when ap- 
plied to an independent sample of patients, This 
cross-validation correlation was .80. The re- 
gression equation and the conversion table de- 
rived from the present study corresponds closely 
to that poo by Sines and Simmons. They 
reported the following equation: WAIS IQ = 
.7463) (S-H total score) + 68.455. The equation 
erived from our group is: WAIS IQ = (.7676) 
(S-H total score) + 65.92. It can be concluded 
that the Shipley-Hartford scale may be used 
profitably in a hospital setting as a substitute for 
pal No when economy of administration is 
esi 
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EDITORIAL OPINION 


ZEN BUDDHISM AND CLINICAL PSYCHOLOGY 


It is a sobering experience to rediscover the wisdom of ancient ways of living 
when seen in relation to contemporary enthusiasm over our newest clinical dis- 
coveries which frequently are whittled down to size when revalued in terms of his- 
torical perspectives. Currently we are rediscovering the phenomenological contri- 
butions of Zen Buddhism tracing back to Hui-neng (637-713) who was concerned 
with the rigorous experiencing of real living and being. Zen Buddhism and other 
Eastern religion-philosophies primarily were concerned with self-psychology, with 
the most direct and transcendental experiencing of the here-and-now, with knowing 
all levels of experiencing up to the ultimate of which man is capable through Zen 
methods of communing and relating. 

Dr. Harold Kelman“ has made a penetrating comparison of the influence of 
different Western and Eastern mind structures on the predominant Western interest 
in rationalistic materialistic science in contrast with Eastern concern with direct 
subjective modes of experiencing. Kelman believes that communication blocks and 
divergent spheres of interests between Western and Eastern scholars result from 
Western noun-oriented and phonetic-alphabetic language which results in the typical 
Western rational-intellectual emphasis on the theoretically conceptualized or in- 
ferred factors in things in contrast with Eastern concern with establishing ‘‘immed- 
iate contact with the Real’ through the most intensive direct experiencing of being 
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and ising. Kelman’s hypothetical model of the symbolic spiral goes beyond the levels 
of abstracting concept of general semantics in clarifying all the levels at which exper- 
ience exists and is symbolized. Both Kelman’s symbolic spiral and general semantics 
theory are insistent, however, on emphasizing the primacy of immediate experiencing 
and being, = on striving to deal with them on as immediate low levels of abstract- 
ing as ible. 

at is the significance of all this for clinical psychology? First, there is the 
contrast between Zen concern with dealing with immediate experiencing and current 
clinical psychology viewed as a product of Western mind structure with its emphasis 
on hypothetical model building and high level conceptualizing. Zen Buddhism was 
wise enough to insist on dealing directly with the most primal kinds of experiencing, 
developing methods for maximizing being and experiencing and real living. Zen 
dealt directly with the raw materials of being and experiencing, and letting no higher 
level abstractions dilute the real essences. Zen masters were, in effect, real psycholo- 
gists dealing directly with the behaviors which are the subject matter of psychology. 
In contrast, Western psychologists have come to be conceptual intellectualizers, 
semantically sophisticated verbalizers and statistical-experimental objectivists who 
} frequently have never even personally experienced the things they are talking 
about. 

A second implication, powerfully supported by contemporary existential and 
henomenonological interests, is that all clinical psychologists from students to the 
ighest professors should accept the challenge of obtaining direct first-hand exper- 

ience with the things which Zen Buddhism (and other forms of existentialism and 
phenomenology) deals with. Kelman opens up fascinating vistas for the future de- 
velopment of clinical psychology and psychiatry in his discussions of the acts of 
communing and relating, not only in one’s own personal life but also in professional 
practice of psychodiagnosis, counseling and psychotherapy. These are things which 
standard curricula in clinical psychology have classically ignored. Indeed, the clinical 
psychologist who concerned himself directly with issues of being and experiencing 
and living has been classically suspect of failing to conform to the currently fashion- 
able Western mind structure determined clinical psychology. 

Finally, a revolution in clinical psychology would occur if clinicians were re- 
quired to deal with and symbolize their subject matter in terms of the most direct, 
low-level primary dimensions of experience such as have been delineated by Zen 
Buddhism or existentialism. Fortunately, this possibility is not as remote as many 
might think. First, clinical psychologists must personally experience and concern 
themselves with the behaviors at issue. Second, they must develop suitable opera- 
tional methods for objectifying what is occurring, so that the data may be preserved, 


communicated and reexperienced by any who wish to repeat the operational methods 
and observations. sue 
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ETHICAL RESPONSIBILITIES OF THE PROFESSION TO CLINICAL 
PSYCHOLOGY 


Ethical responsibility is a bilateral relationship, involving certain obligations 
on the part of both parties. The APA recently has reevaluated and greatly strength- 
ened its code of ethics, particularly in relation to the activities of applied psycholo- 
gists. Clinical psychologists are constantly reminded of their ethical duties both to 
the profession and to the public. We would also like to insist that academicians and 
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basic scientists have obligations to advance (and at least, not to hinder) the applica- 
tions of psychology. : 

The history of American psychology has been characterized by a continuing 
rift between basic science psychologists and applied psychologists. Several attempts 
have been made by applied and clinical psychologists, as in the organization of the 
AAAP, to establish their own professional organizations with powers comparable to 
APA responsibilities. In professional psychology, in contrast to professional medi- 
cine, control of the parent APA has remained vested in the hands of academicians 
with workers in the field exerting relatively little influence. This situation often 
works to the detriment of applied and clinical psychology which cannot always 
secure needed cooperation from their academic and basic science colleagues. 

One manifestation of the situation is reflected in the tendency of academicians 
not to join and support the activities of state and local organizations which are pre- 
dominantly applied in their orientation. The academicians protest that they are not 
interested in problems of the development of applied clinical psychology and that 
they are not interested in participating on local levels. This attitude may create 
difficult professional problems as in smaller states where the top academic personnel 
refuses to be identified with the state psychological association, and thus creating 
the public impression that they do not support its activities in the direction of 
licensure, improvement of working conditions, securing grants-in-aid, etc. In our 
opinion, the academicians have as much of an ethical obligation to support clinical 
activities as the clinicians have the obligation to support basic science. 

Another manifestation may occur within university psychology departments 
where basic science and applied divisions often experience great difficulty in working 
together and even may require separate departmental organizations. In many of 
the larger universities, psychology is taught under the auspices of several separate 
departments which may teach different curricula and have relatively little to do with 
each other. This may create great difficulties for students of clinical psychology who 
have first to get by the academicians before they can enter their chosen field. Here 
a unilateral situation exists in which a clinical student is required to demonstrate 
experimental-statistical research proficiency in order to qualify for advanced clinical 
training, but experimental students are not required to demonstrate clinical pro- 
ficiency as a precondition for advanced experimental-statistical training. A para- 
doxical situation exists in which clinicians are also expected to be well rounded 
statistical-experimental researchers, while experimentalists may call themselves 
psychologists without ever having demonstrated any clinical or applied proficiencies. 
There are many promising clinical students, possessing native high psychological 
intelligence who may not be skilled researchers, and such students should not be dis- 
qualified because they cannot pass the academicians. Clinical psychology must have 
the same right to choose its own students as the experimentalist-statisticians have to 
choose their students. 

While it may be argued that the dichotomy of pure scientist vs. applied scientist 
is an artificial one, and that the ideal psychologist combines both types of training 
and ability, in practice we rarely find this ideal combination. There are many appli- 
cations of psychology which do not require high experimental-statistical ability 
which appears to be much rarer than administrative and clinical skills required of 
the applied psychologist. 

What we are trying to advocate is that all psychologists have an ethical obliga- 
tion to support both basic science and applied psychology. Applied psychology too 
long has been regarded as sort of a necessary evil or fifth wheel, to be tolerated but 
not actively supported. 


F. C. T. 
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TWO BOOKS OF UNUSUAL INTEREST 
AND IMPORTANCE ... 


BOOKS 


RORSCHACH PSYCHOLOGY 


Edited by MARIA A. RICKERS-OV- 
SIANKINA, University of Connecticut. Di- 
rected to theory-minded readers from the ranks 
of clinicians, general psychologists, and students 
alike, Rorschach Psychology is the first book 
that views the Rorschach method not merely as 
the vehicle of diagnostic or descriptive informa- 
tion about human personality, but primarily as 
a problem in general psychological thinking. 
Research findings and Guvenstcal postulates 
developed in other areas of psychology are em- 
ployed in the exploration of the intrinsic mean- 
ing of Rorschach variables. Conversely, the 
reader is introduced to the numerous ways in 
which the Rorschach, if properly understood, 
may enrich the broader field of psychological 
knowledge. 


Rorschach issues that have stimulated basic 
research have been selected for exploration in 
individual chapters of the book. Each chapter 
is handled by an author for whom the particular 
topic constitutes a major research interest. An 
introductory chapter supplies a brief theoreti- 
cally oriented synopsis of the method and 
assigns the place of every chapter topic within 
the over-all frame. 

Rorschach Psychology endeavors to enhance 
the conceptual precision of the basic text var- 
iables, to detect and delineate gaps in our 
knowledge, and to point up pertinent research 
areas. It thus lays the groundwork for further 
fruitful interaction between theory and practice 
in the complex task of developing this most 
challenging field of psychological investigation. 
1960. Approx. 480 pages. Prob. $8.50. 


DRUGS AND BEHAVIOR 


Edited by LEONARD UHR and JAMES G. 
MILLER, both of University of Michigan. This 
book deals with a problem which is of para- 
mount importance in present-day drug re- 
search: the effects of psychoactive drugs 
(tranquilizers, energizers, psychotomimetics, 
etc.) on human behavior and experience. In 
view of the social and ethical as well as the 
medical and psychiatric dimensions of the 
problem, it is essential that methods be de- 
veloped for the reliable and accurate assessment 
of a drug’s effects. Drugs and Behavior empha- 
sizes objective behavioral study involving 
human subjects as the means for drug evalua- 
tion. It is in a sense a “progress report’’ on the 
search for more precise experimental methods 
in the delineation of the behavioral toxicity, 
therapeutic effects, and action of psychoactive 
drugs. 


The papers included in the book are of two 
general types: articles discussing general theo- 
retical aa methodological issues involved in 
drug research, and reports on specific researches 
conducted by the contributors. Thus, the book 
not only reviews and summarizes important 
work already done in psychopharmacology, it 
also sets forth the various conceptual and 
methodological frameworks which are pertinent 
to the study of psychoactive drugs. In addition, 
several of the articles suggest and describe new 
methods and techniques which promise to be of 
great value in future psychopharmacological 
research. Also, the general index offers a brief 
description of the action and known effects of 
each drug mentioned in the book. 1960. Ap- 
prox. 608 pages. Prob. $12.00. 


SEND FOR EXAMINATION COPIES. 


JOHN WILEY & SONS, Inc. 
440 Park Avenue South, New York 16, New York. 


NEW 1960 BOOKS 
for the 


CLINICAL 
PSYCHOLOGIST 


PSYCHOLOGICAL AND PSYCHIA- 
TRIC ASPECTS OF SPEECH AND 


HEARING. Edited by Dominick A. Barbara, 
Karen Horney Clinic, N. Y. C. Through the com- 
bined efforts of twenty-five well-known specialists 
this significant work became a reality—the first and 
only comprehensive book on the subject. To insure 
a broad, eclectic viewpoint contributors were pooled 
from disciplines including chiatry, 8, ther- 
apy, spe teaching, psycho neurology, and 
social work. The result is a sound, sensible volume 
on the psychological processes involved in the 
origin, comprehension, and treatment of speech, 
hearing, and communication disorders. Pub. Oct., 
"60, 756 pp., $19.50. 


EARLY IDENTIFICATION OF EMO- 
TIONALLY HANDICAPPED CHILD- 
REN IN SCHOOL. By Eli M. Bower, Cali- 
fornia State i. of Education, Sacramento. Based 
on research with over 40,000 children, this book pre- 
sents the conceptual underpinnings of the term 
“emotionally handicapped” and the findings of re- 
search. Shows how this systematic, economical, and 
effective screening procedure can be employed by 
teachers and schools without a disproportionate 
amount of extra time or effort. Pub. Sept., ’60, 136 
pp., 24 il., (Amer. Lec. Psychology), $5.50. 


THE PREDICTION OF OVERT BE- 
HAVIOR THROUGH THE USE OF 
PROJECTIVE TECHNIQUES. By Arthur 
C. Carr, Columbia Univ.; Bertram R. Forer, Univ. 
of Calif.; William E. Henry, Univ. of Chicago; 
velyn Hooker, Univ. of Calif.; Max L. Hutt, 
Univ. of Michigan; and Zygmunt A. Piotrowski, 
Jefferson Med. Coll. of Philadelphia. The principles 
for inferring overt behavior from fantasy and other 
—— materials are formulated by experts in the 
ld of clinical psychology and LP testing. 
Whether approached as a study of the prediction of 
overt behavior, as a guide to projective test inter- 
pretation, or as an elucidation of the possible etiol 
and psychodynamics of overt sexual psychopa 
ology, this volume represents an impressive contri- 
bution to clinical theory and practice. Pub. Oc., 
"60, 192 pp., 23 il., (Amer. Lec. Psychology), $6.75. 


THE PERSON SYMBOL IN CLINICAL 


MEDICINE: A Correlation of Picture Draw- 
ings With Structural Lesions of the Brain. 3 
Robert Cohn, U.S. Naval Hospital Md. 


, Bethesda, 

The principal plan of presentation is to indicate the 
physiological substrate and some of the psycho- 
philosophical mechanisms whereby man “knows,” 
and the indices whereby this acquisition of know- 
ledge becomes manifest. This basic formulation is 
followed by the systematic and controlled —— 
tion of the development of a person symbol and the 
distortion and dissolution of a person symbol asso- 
ciated with disease processes of the brain. A precise 
reference for the relation of picture drawings and 
structural lesions of the brain for all investigators of 
the formulation of a “body image concept.” Pub. 
Oct., 60, 272 pp. (7 x 10), 70 il. 


ON PSYCHOTHERAPY OF SCHIZO- 


PHRENIA. By Albert Schefien, Temple Univ. 
The therapist himself can go but so far in his descrip- 
tion of what he does. The present work is a skilled 
and dispassionate study of Dr. John N. Rosen and 
his “Direct Analysis” method. It is a service to 
psychiatry to have such a prolonged study of one 
man’s treatment of schizophrenia. Such an account 
gets closer to the truth of what goes on when doctor 
and patient meet and change is produced. Pub. Dec., 
’60, (Amer. Lec. 


( 


THE PHYSIOLOGY OF EMOTIONS. 
Edited by Alexander Simon, Univ. California, 
Charles C. Herbert, Kaiser Foundation H 
San Francisco, and Ruth Straus, Kaiser Fou 
tion Hosp., Oakland. This exchange of hypotheses, 
data, and comment between world acknowledged 
authorities provides a feasible basis for an approach 
to the problems presented by human emotion. Cur- 
rent knowledge of the centers and pathways that 
control the emotions is reviewed. Observations of 
the physiologic responses to emotions, the biochemi- 
cal processes involved in emotional states, and the 
sites of action of err F products are 
summarized. Pub. Nov. ’60, about 233 pp. 


Send for our new 1960-61 catalog 
of over 1250 titles. 


CHARLES C THOMAS - PUBLISHER 
301-327 East Lawrence Avenue 
Springfield, Illinois 


Researcu Desicn AssociATES 
(A Division of the Journal of Clinical Psychology) 


The Editorial Staff of the Journal of Clinical Psychology is now organized 
to provide professional consultation services on all aspects of research design, 
statistical evaluation, data processing, editorial criticism and publication 
problems. Specialized consultants now are available in the areas listed below. 

Project ORIGINATION RESEARCH PLANNING 
EXPERIMENTAL DeEsIGN Data ProcessinGc 
Statistica, Desicn STATISTICAL EVALUATION 
CriticaL EvALuaTion Data ANALYSIS 


EprroriaA CRITICISM PuBLICcATION PRoBLEMS 


These services are offered in connection with research problems in any of 
the behavior sciences. Our consultants are particularly competent in relation 


to clinical problems in clinical psychology, psychiatry and the applied social 
sciences. 


These services will be available at all stages of research design, data 
collection and data evaluation. Please consult us if you desire an independent 
evaluation of research designs or work in progress. 


Consultations may be arranged either by mail or by a direct visit to your 
installation. Where indicated, several consultants will render opinions con- 
cerning various aspects of the project. Gratis estimates of costs may be ob- 
tained by sending an outline of the problems involved together with a state- 


ment of the services desired. For further information please write to the 
address given below. 


RESEARCH DESIGN ASSOCIATES 
JOURNAL OF CLINICAL PSYCHOLOGY 
5 Pearl Street 
Brandon, Vermont 


a) 


“The best in tests 


Could you use the new, 1960 edition 
of this well-known test for brain in- 
jury and perceptual-motor coordina- 
tion, tested and proven in clinical use 
and in research since 1945, with new 
norms based on large clinical and con- 


trol groups? 


Graham-Kendall 


MEMORY 
FOR 
DESIGNS 
TEST 


Then order Tester’s Set consisting of 
durable plates and 1960 manual 
($8.50), or just the 1960 manual 
($2.50) if you already have plates. 


Obtain further information by writing 


PSYCHOLOGICAL TEST 
SPECIALISTS 


Box 1441 
Missoula, Montana 


NEW EDITION 


2 
4, 
me 


DEVEREUX SERVES 
THE ATYPICAL CHILD 


SINCE ITS FOUNDING almost fifty years ago, the followin | 
types of children have benefited most from the multidisciplin 
rehabilitation program at Devereux Schools: 

1. Emotionally disturbed children of normal or superior in- 
telligence who have difficulty adjusting to the conventional school 
setting. 

2. Pre-psychotic and psychotic children at all levels of 
intellectual capacity who can benefit oom intensive individual 
therapy. 

3. Children with specific ohuntionst disabilities, such as 
aphasia or reading, visual, speech, or auditory monennge. 

4. Children who have brain injuries. 

5. Educable mentally retarded children with or without 
emotional difficulties. 


Exhaustive pre-enrollment evaluations of each child determine 
his placement in one of the twenty-two separate school-units 
located in Pennsylvania, California, and Texas. Highly in- 
dividualized, academic, commercial, and vocational programs 
are conducted under skilled supervision. 


CLINICAL STAFF 
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Clifford Scott, M.D. 
Aurelio Buonanno, M, 
Charles M. M.D. 
lema: 


Fred J. Cu 
Ruth E. M.D 
William F 


THE DEVEREUX FOUNDATION COMMUNITIES 

A nonprofit organization Founded 1912 | camps 
Devon, Penneylsania TRAINING 

Santa Barbara, California Victoria, Texas | RESEARCH 


HELENA T. DEVEREUX Professional inquiries for Eastern 


dministrative Consultant Schools should be directed to 
4 Cuarugs J. Fo Director of 


ools, Dev- 


EDWARD L. FRENCH, Ph.D. 
Director 


Treasurer in Texas, Box 336, Victoria, Tex. 


F. Ellsworth S.T.D. 
Milton Brutten, PhD. 
William J. Cohen, Ph.D. 
Dorothy E. Conrad, Ph.D. 
Sidney L. Copel, Ed.D. : 
A M, Jahneon, Ph.D. 
Robert L. Hunt, M.D. Noel Jenkin, Ph.D. 
Richard H. Lambert, M.D. John R. Kleiser, Ph.D. 
Leonardo Magran, M.D. Ph.D. 
Sumy J. Peters, M.D. Henry tt, Ph.D. 
Jacob S. Sherson, M.D. Edgar A. Smith, Ed.D. 
Albert S. Terzian, M.D. George Spine. Ph.D. 
Kenneth E. Evans, B.S. 
Psychoanalytic Consultants 
RS G. Henry Katz, M.D. Herbert H. Herskovitz, M. D. 
to Kerra A. Seaton, Registrar, 
Devereux Schools in 


